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ABSTRACT 


Two  deduction  rules  are  introduced  to  give  streamlined  treatment  to  relations  of  special  importance  in  an 
automated  theorem-proving  system.  These  rules,  the  relation  replacement  and  relation  matching  rules,  gen¬ 
eralize  to  an  arbitrary  binary  relation  the  paramodulation  and  El-resolution  rules,  respectively,  for  equality, 
and  may  operate  within  a  nonclausal  or  clausal  system.  The  new  rules  depend  on  an  extension  of  the  notion 
of  polarity  to  apply  to  subterms  as  well  as  to  subsentences,  with  respect  to  a  given  binary  relation.  The  rules 
allow  us  to  eliminate  troublesome  axioms,  such  as  transitivity  and  monotonicity,  from  the  system;  proofs  are 
shorter  and  more  comprehensible,  and  the  search  space  is  correspondingly  deflated. 


1.  INTRODUCTION 


In  any  theorem-proving  system,  the  task  of  representing  properties  of  objects  is  shared  between  axioms 
and  rules  of  inference.  The  axioms  of  the  system  are  easier  to  introduce  and  modify,  because  they  are 
expressed  in  a  logical  language.  However,  because  axioms  are  declarative  rather  than  imperative,  they  are 
given  no  individual  heuristic  controls.  The  rules  of  inference,  on  the  other  hand,  cannot  be  altered  without 
reprogramming  the  system,  and  they  are  usually  expressed  in  the  system’s  programming  language.  However, 
the  rules  can  be  given  individual  heuristic  controls  and  strategies. 

It  is  customary  to  use  rules  of  inference  to  express  properties  of  the  logical  connectives,  which  are  the 
same  from  one  theory  to  the  next,  and  to  use  axioms  to  express  properties  of  constants,  functions,  and 
relations,  which  may  vary.  It  is  hazardous,  however,  to  express  certain  properties  of  functions  and  relations 
by  axioms.  Some  properties  of  the  equality  relation,  for  example,  are  rarely  represented  axiomatically.  For 
one  thing,  in  a  first-order  system  indefinitely  many  axioms  are  necessary  to  represent  the  substitutivity 
property  of  this  relation,  depending  on  how  many  function  and  relation  symbols  are  in  the  vocabulary  of 
the  theory. 

For  instance,  for  a  binary  function  symbol  f(x,y),  we  must  introduce  two  functional- substitutivity  ax¬ 
ioms, 

if  x  =  y  ,  if  x  =  y 

then  f(x,z)  =  f(y,z)  then  f{z,x)  =  f(z,y), 

and  for  a  binary  predicate  symbol  p(x,  y),  we  must  introduce  two  predicate-substitutivity  axioms, 

if  x  =  y  and  if  x  —  y 

then  if  p(x,z)  then  p{y,z)  then  if  p(z,x)  then  p(z,y). 

An  abbreviated  version  of  this  paper  appears  in  the  proceedings  of  the  Twelfth  International  Colloquium 
on  Automata,  Languages,  and  Programming  (ICALP),  Nafplxon,  Greece,  July  1985. 
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l.  Introduction 


(We  tacitly  quantify  variables  universally  over  the  entire  sentence.)  In  general,  for  each  n-ary  function 
symbol  /( xi,  . . .  ,  xn),  we  introduce  n  functional- sustitutivity  axioms.  Similarly,  for  each  n-ary  predicate 
symbol  p(x i,  . .  . ,  xn),  we  introduce  n  predicate- sub stitutivity  axioms. 

More  importantly,  axioms  for  equality  are  difficult  to  control  strategically,  because  they  have  many 
irrelevant  consequences.  An  axiom  such  as  transitivity, 

if  x  =  y  and  y  —  z 
then  x  =  z, 

will  allow  us  to  derive  logical  consequences  from  any  sentence  mentioning  the  equality  relation.  Few  of  these 
consequences  will  have  any  bearing  on  the  proof. 

In  response  to  this  problem,  some  theorem-proving  researchers  have  paraphrased  their  theories  to  avoid 
explicit  mention  of  the  equality  axiom  (e.g.,  Kowalski  [79]).  Others  have  adopted  special  inference  rules  for 
dealing  with  equality.  In  resolution  systems,  two  equality  rules,  paramodulation  (Wos  and  Robinson  [69]) 
and  E-resolution  (Morris  [69])  have  been  found  to  be  effective.  Variations  of  these  rules  are  used  in  many 
theorem  provers  today  (e.g.,  Boyer  and  Moore  [79],  Digricoli  [83]).  By  a  single  application  of  either  of  these 
rules,  we  can  derive  conclusions  that  would  require  several  steps  if  the  properties  of  equality  were  represented 
axiomatically.  The  proofs  are  markedly  shorter,  and  the  search  spaces  are  even  more  dramatically  compressed 
because  the  axioms  and  intermediate  steps  are  not  required.  Within  their  limited  domain  of  application, 
theorem-proving  systems  using  these  rules  surpass  most  human  beings  in  their  capabilities. 


SPECIAL  RELATIONS 

The  authors  became  involved  in  theorem  proving  because  of  its  application  to  program  synthesis,  the  deriva¬ 
tion  of  a  program  to  meet  a  given  specification.  We  have  been  pursuing  a  deductive  approach  to  this  problem, 
under  which  computer  programming  is  regarded  as  a  theorem-proving  task.  In  the  proofs  required  for  pro¬ 
gram  synthesis,  certain  relations  assume  special  importance.  Again  and  again,  proofs  require  us  to  reason 
not  only  about  the  equality  relation,  but  also  about  the  less-than  relation  <  (over  the  integers  or  reals), 
the  subset  relation  C,  the  sublist  relation  <u»t,  or  the  subtree  relation  <tree ■  To  represent  the  transitivity 
and  other  properties  of  these  relations  axiomatically  leads  to  many  of  the  same  problems  that  were  faced 
in  dealing  with  equality:  the  axioms  apply  almost  everywhere,  spawning  innumerable  consequences  that 
swamp  the  system.  Yet  we  would  not  want  to  implement  a  new  inference  rule  for  each  of  the  relations  we 
find  important. 

Both  the  paramodulation  and  the  E-resolution  rules  are  based  on  the  substitutivity  property  of  equality, 
that  if  two  elements  are  equal  they  may  be  used  interchangeably;  i.e.,  for  any  sentence  P  (x,  y),  the  sentence 

if  x  =  y 

then  if  P{x,  y)  then  P(y,  x) 

is  valid.  Here  P(y,  x)  is  the  result  of  replacing  in  P  (x,  y)  certain  (perhaps  none)  of  the  occurrences  of  x  with 
y,  and  certain  (perhaps  none)  of  the  occurrences  of  y  with  x.  (The  notations  we  use  here  informally  will  be 
defined  systematically  later  on.  We  assume  throughout  that  sentences  are  quantifier-free.) 

We  observe  that  many  of  the  relations  we  regard  as  important  exhibit  substitutivity  properties  similar 
to  the  above  property  of  equality,  but  under  restricted  circumstances.  For  example,  over  the  nonnegative 
integers,  we  can  show  that 

if  x  <y 

then  if  a  <  x  •  b 
then  a  <  y  ■  b 
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and,  over  the  lists,  we  can  show  that 

if  x  <iut  y 
then  if  u  G  x 

then  u  €  y. 

Knowing  that  x  <  y  or  that  x  ^ust  y  does  not  allow  us  to  use  x  and  y  interchangeably,  but  it  does  allow  us 
to  replace  certain  occurrences  of  x  with  y,  and  vice  versa. 

Based  on  such  substitutivity  properties,  we  can  introduce  two  deduction  rules  that  generalize  the 
paramodulation  and  E-resolution  rules  for  equality  to  an  arbitrary  relation,  under  appropriate  circumstances. 
Just  as  the  equality  rules  enable  us  to  drop  the  transitivity  and  substitutivity  axioms  for  equality,  the  new 
relation  rules  enable  us  to  drop  the  corresponding  troublesome  axioms  for  the  relations  of  our  theory. 


POLARITY 


For  the  equality  relation,  knowing  that  x  —  y  allows  us  to  replace  in  a  given  sentence  any  occurrence  of 
x  with  y  and  any  occurrence  of  y  with  x,  obtaining  a  sentence  that  follows  from  the  given  one.  For  an 
arbitrary  binary  relation  -4,  knowing  that  x  -4  y  still  may  allow  us  to  replace  certain  occurrences  of  x  with  y 
and  certain  occurrences  of  y  with  x.  We  describe  a  syntactic  procedure  that,  for  a  given  relation  -4,  identifies 
which  occurrences  of  x  and  y  in  a  given  sentence  can  be  replaced,  provided  we  know  that  x  -4  y. 

More  precisely,  we  identify  particular  occurrences  of  subexpressions  of  a  given  sentence  as  being  positive 
(+),  negative  (— ),  or  both,  or  neither,  with  respect  to  -4.  If  x  -4  y,  positive  occurrences  of  x  can  be 
replaced  with  y,  and  negative  occurrences  of  y  can  be  replaced  with  x.  In  other  words,  we  can  establish  the 
substitutivity  property  that,  for  any  sentence  P{x+,  y~),  the  sentence 

if  x -4  y 

then  if  P(x+ ,  y~)  then  P(y+ ,  x~) 

is  valid  (over  the  theory  in  question).  Here  P(y+ ,  x~)  is  the  sentence  obtained  from  P(x+ ,  y~)  by  replacing 
certain  positive  occurrences  of  x  with  y  and  certain  negative  occurrences  of  y  with  x.  With  respect  to  the 
equality  relation,  every  subexpression  is  both  positive  and  negative;  therefore,  if  we  take  -4  to  be  =,  this 
property  reduces  to  the  substitutivity  of  equality. 

Our  new  rules  are  based  on  the  above  substitutivity  property  just  as  the  equality  rules  are  based  on  the 
substitutivity  of  equality.  The  new  rules,  like  the  equality  rules,  allow  us  to  perform  in  a  single  application 
inferences  that  would  require  many  steps  in  a  conventional  system.  Proofs  are  shorter  and  closer  to  an 
intuitive  argument;  the  search  space  is  condensed  accordingly. 


NONCLAUSAL  DEDUCTION 


The  paramodulation  and  El-resolution  rules  are  formulated  for  sentences  in  clausal  form  (a  disjunction  of 
atomic  sentences  and  their  negations);  on  the  other  hand,  the  two  corresponding  rules  we  introduce  apply  to 
free-form  sentences,  with  a  full  set  of  logical  connectives  (cf.  Manna  and  Waldinger  [80],  Murray  [82],  Stickel 
[82]).  By  adopting  such  a  nonclausal  system,  we  avoid  the  proliferation  of  sentences  and  the  disintegration 
of  intuition  that  accompany  the  translation  to  clausal  form.  Also,  it  is  awkward  to  express  the  mathematical 
induction  principle  in  a  clausal  system,  because  we  must  do  induction  on  sentences  that  may  require  more 
than  one  clause  to  express.  On  the  other  hand,  our  rules  are  also  immediately  and  directly  applicable  to 
clausal  theorem-proving  systems. 
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2.  Preliminaries 


OUTLINE 

In  the  following  section,  Preliminaries,  we  sketch  the  basic  concepts  of  logic  that  we  use  in  this  paper  and 
we  briefly  outline  a  nonclausal  deduction  system.  Readers  who  are  familiar  with  this  material  should  skim 
the  section  anyway,  to  become  acquainted  with  our  terminology  and  notations. 

In  Relational  Polarity  we  introduce  our  central  notion,  the  polarity  of  a  subexpression  of  a  sentence 
with  respect  to  a  given  relation. 

We  then  describe,  in  The  Relation  Replacement  Rule,  a  new  deduction  rule  that  allows  us  to  replace 
a  subexpression  of  a  sentence  with  another  expression,  under  a  wide  variety  of  circumstances.  This  is  our 
generalization  of  the  paramodulation  rule. 

The  rules  in  our  system  can  be  applied  when  two  subexpressions  can  be  unified.  However,  our  second 
deduction  rule,  described  in  The  Relation  Matching  Rule,  allows  us  to  draw  a  conclusion  even  though 
two  subexpressions  fail  to  unify.  (Typically  this  rule  is  applied  when  the  two  subexpressions  “nearly*  unify.) 
This  is  our  generalization  of  the  E-resolution  rule. 

In  Strengthening  we  tighten  up  our  theory  of  polarity  to  allow  the  relation  replacement  rule  to  draw 
a  stronger  conclusion,  in  many  circumstances. 

In  Extensions,  we  indicate  how  the  notions  in  this  paper  can  be  extended  to  apply  to  sentences 
which  contain  explicit  quantifiers  and  to  define  polarity  with  respect  to  functions  as  well  as  relations;  we 
develop  more  general,  conditional  versions  of  all  the  rules;  and  we  show  how  our  results  apply  to  problems 
in  automated  planning. 


2.  PRELIMINARIES 

Before  we  can  define  our  central  notion,  that  of  polarity  of  a  subexpression  with  respect  to  a  relation,  we 
must  introduce  some  concepts  and  notations.  We  will  be  brief  and  informal,  because  we  believe  that  this 
material  will  be  familiar  to  most  readers. 


EXPRESSIONS 

We  consider  terms  composed  (in  the  usual  way)  of  the  following  symbols: 

•  The  constant  symbols  a,  b,  c,a i,  . . . ,  s,  t,  and  special  constants  such  as  0. 

•  The  variable  symbols  u,  v,w,x,y,Ui, _ 

•  The  n-ary  function  symbols  f,g,h,  fi,  ...  and  special  symbols  such  as  +. 

Thus  a,  x,  f(a,  x),  and  /(a,  x)  +  0  are  terms. 

We  consider  propositions  composed  (in  the  usual  way)  from  terms  and  the  following  symbols: 

•  The  truth  symbols  (logical  constants)  true  and  false. 

•  The  n-ary  relation  symbols  p,  q,  r,  pi,  . . .  and  special  symbols  such  as  =  and  <. 

Thus  true  and  p(a,  g(z))  are  propositions. 

We  consider  sentences  composed  (in  the  usual  way)  from  propositions  and  the  following  symbols: 

•  The  logical  connectives  not,  and,  or,  if-then,  =  ( if-and-only-if ),  if-then-else. 
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Thus  (a  <  0)  or  not(j?[a ,  <7(z)))  is  a  sentence. 

The  operators  consist  of  the  function  and  the  relation  symbols.  The  expressions  consist  of  the  terms  and 
the  sentences.  Note  that  we  do  not  include  the  quantifiers  V  and  3  in  our  language.  The  ground  expressions 
are  those  that  contain  no  variables.  The  expressions  that  occur  in  a  given  expression  are  its  subexpressions. 
They  are  said  to  be  proper  if  they  are  distinct  from  the  entire  expression. 


REPLACEMENT 

We  introduce  the  operation  of  replacing  subexpressions  of  a  given  expression  with  other  expressions.  We 
actually  have  two  distinct  notions  of  replacement,  depending  on  whether  or  not  every  occurrence  of  the 
subexpression  is  to  be  replaced. 

Suppose  s,  t,  and  e  are  expressions,  where  s  and  t  are  either  both  sentences  or  both  terms.  If  we  write 
e  as  e[s],  then  e[t\  denotes  the  expression  obtained  by  replacing  every  occurrence  of  s  in  e[s]  with  t;  we  call 
this  a  total  replacement.  If  we  write  e  as  e(s),  then  e(t)  denotes  the  expression  obtained  by  replacing  certain 
(perhaps  none)  of  the  occurrences  of  s  in  e(s)  with  t\  we  call  this  a  partial  replacement. 

When  we  say  we  replace  certain  (perhaps  none)  of  the  occurrences  of  s,  we  mean  that  we  replace  zero, 
one,  or  more  occurrences.  We  do  not  require  that  e[s]  or  e(s)  actually  contain  any  occurrences  of  s;  if  not,  e[i] 
and  e(t)  are  the  same  as  e[sj  and  e{s),  respectively.  Also,  while  the  result  of  a  total  replacement  is  unique, 
a  partial  replacement  can  produce  any  of  several  expressions. 

For  example,  if  e[s]  is  p(s,  s,  6),  then  e[t\  is  p(t,  t,  b).  On  the  other  hand,  if  e(s)  is  p(s,  s,  6),  then  e(t)  could 
be  any  of  p(s,  s,  b),  p(t,  s,  b),  p[s,  t,  b),  or  p(f,  t,  6).  If  we  want  to  be  more  specific  about  which  occurrences 
are  replaced,  we  must  do  so  in  words. 

A  partial  replacement  is  invertible,  in  the  sense  that  any  sentence  e(s)  can  be  retrieved  by  replacing 
certain  occurrences  of  t  in  e(t)  with  s.  The  occurrences  of  t  to  be  replaced  are  precisely  the  ones  introduced 
in  obtaining  e(t)  in  the  first  place.  For  example,  if  e{s)  is  p[s,s,t),  and  e(t)  is  p{s,t,t),  then  e(s)  can  be 
retrieved  by  replacing  the  newly  introduced  occurrence  of  t  in  e(t)  with  s. 

Total  replacement,  on  the  other  hand,  is  not  invertible  in  the  same  sense.  For  example,  if  e[s]  is  p(s,  s,  t), 
then  e[i]  is  p(t,t,t),  and  e[s]  cannot  be  obtained  from  e[t]  by  replacing  every  occurrence  of  t  in  e[t)  with  s. 


MULTIPLE  REPLACEMENT 

We  can  extend  the  definition  to  allow  the  replacement  of  several  subexpressions  at  once: 

Suppose  si,  ...,sn,t  1,  ...,tn,  and  e  are  expressions,  where  the  s<  are  distinct  and,  for  each  i,  St¬ 
and  ti  are  either  both  sentences  or  both  terms.  If  we  write  e  as  e[si,  . . .  ,s„],  then  e\ti,  . . .  ,tn]  denotes 
the  expression  obtained  by  replacing  simultaneously  every  occurrence  of  each  expression  s,-  in  e  with  the 
corresponding  expression  i,-;  we  call  this  a  multiple  total  replacement.  If  we  write  e  as  e(si,  . . .  ,s„),  then 
e(ti,  . . .  ,tn)  denotes  any  of  the  expressions  obtained  by  replacing  simultaneously  certain  (perhaps  none)  of 
the  occurrences  of  some  of  the  expressions  st-  in  e  with  the  corresponding  expression  U;  we  call  this  a  multiple 
partial  replacement. 

The  replacements  are  made  simultaneously  in  a  single  stage.  For  example,  if  e[a ,  b )  is  /(a,  b),  then  e\b,  c] 
is  f{b,c).  On  the  other  hand,  if  e(a,  b)  is  /(a,  6),  then  e(b,  c)  could  denote  any  of  f(a,  b),  f(b,b),  /(a,  c),  or 
f(b,  c).  Even  though  a  is  replaced  by  b  and  b  is  replaced  by  c,  the  newly  introduced  occurrences  of  b  are  not 
replaced  by  c. 

The  replacements  are  made  from  the  top  down.  For  example,  if  e[p(a,  b),  a]  is  p[a,b),  then  e[true,  6]  is 
true.  We  replace  both  p(a,  b)  and  a,  but  a  is  a  subexpression  of  p(o,  6).  In  such  cases,  by  convention,  it  is  the 
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outermost  subexpression  that  is  replaced.  (For  the  corresponding  partial  replacement,  either  subexpression 
can  be  replaced.) 

By  attaching  a  numerical  superscript,  we  can  specify  exactly  how  many  subexpression  occurrences  are 
to  be  replaced  in  a  partial  replacement.  Suppose  si,  . . .  ,sn,t i,  . . . ,  tn,  and  e(si,  . . . ,  sn)  are  expressions 
and  A;  is  a  nonnegative  integer,  where  the  s»  are  distinct  and,  for  each  i,  Si  and  t{  are  either  both  sentences 
or  both  terms.  Then  e(ti,  . . .  ,tn)k  is  the  result  of  replacing  in  e(sj ,  ...,s„)  precisely  k  occurrences  of 
«i,  . . .  ,sn  with  the  corresponding  expression  tlt  . . .  ,tn.  [We  assume  that  at  least  k  occurrenes  exist.] 

Note  that  precisely  k  occurrences  are  replaced  altogether.  For  example,  suppose  e{a,b)  is  c  <  f(a,a,b); 
then  e(a  +1,6+  l)2  could  denote  any  of 

c  <  /(a  +  1,  a  +  1, 6),  c  <  f[a  +  1,  a,  b  +  1),  or  c  <  f(a,  a  +  1, 6  +  l), 

but  not 

c  <  /(a  +  1,  a  +  1, 6  +  1)  or  c  <  f[a  +  1,  a,  6). 

We  may  also  write  e{ti,  f2,  . . . ,  tn)k,t  to  indicate  that  precisely  k  or  l  replacements  are  made  in  the  expression 

e(sl>  s2t  •  •  •  )  sn). 


SUBSTITUTIONS 

We  have  a  special  notation  for  a  substitution,  indicating  the  total  replacement  of  variables  with  terms.  A 
theory  of  substitutions  was  developed  by  Robinson  [65],  in  the  paper  in  which  the  resolution  principle  was 
introduced.  A  fuller  exposition  of  this  theory  appears  in  Manna  and  Waldinger  [81], 

For  any  distinct  variables  x\,  x2,  ■ .  ■ ,  xn  and  any  terms  tx,  t2,  . . . ,  tn,  a  substitution 

6:  {zi  i—  ti,  x2  <-  t2,  ...,xn*-tn} 

is  a  set  of  replacement  pairs  X{  <—  t{.  Note  that  there  are  no  substitutions  of  form  {x  +—  a,  x  *—  b,  . . .  }, 
where  a  and  6  are  distinct.  (If  a  and  6  are  identical,  then  the  set  {x  *—  a,  x  <—  a,  . . .  }  is  the  same  as  the  set 
{x  *—  a,  ...  }.)  The  empty  substitution  {  }  is  the  set  of  no  replacement  pairs. 

For  any  substitution  6  and  expression  e,  we  denote  by  ed  the  expression  obtained  by  applying  6  to  e, 
i.e.,  by  simultaneously  replacing  every  occurrence  of  the  variable  x,-  in  e  with  the  expression  t,-,  for  each 
replacement  pair  x,  «—  U  in  9.  We  also  say  that  eS  is  an  instance  of  e.  For  example, 

p(x,y){x  *-  y,  y  <-  a}  =  p{y,a). 

The  empty  substitution  {  }  has  the  property  that  e{  }  =  e  for  any  expression  e. 

Two  substitutions  6  and  A  are  said  to  be  equal  if  they  have  the  same  effect  on  any  expression,  i.e.,  if, 
for  any  expression  e, 

ed  =  eX. 

For  example, 

{x  <—  a,  y  *—  6}  =  {x  <—  a,  y  <—  6,  z  z}. 

Two  substitutions  9  and  A  are  equal  if  they  agree  on  all  variables,  i.e.,  if  x9  =  xA  for  all  variables  x. 

For  any  variable  x,  term  t,  and  substitution  9,  the  result 
(x  «—  t)  o  9 
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of  adding  the  replacement  pair  x  *—  t  to  6  is  defined  to  be  the  substitution  that  replaces  x  with  t  but  agrees 
with  9  on  all  other  variables.  It  is  thus  defined  by  the  properties 

x((x  <—  t)  o  S')  =  t 

y((x  *—  t)  o  d)  =  yd,  for  all  variables  y  distinct  from  x. 

Note  that  6  may  already  replace  x  with  some  term  t'\  if  so,  that  replacement  is  superseded  by  the  new  one. 
For  example, 

(y^-6)o{}  =  {y*-b} 

{x  *—  a)  o  {y  <—  6}  =  {x  <—  a,  y  *-  b} 

{y<-c)o{x<-a,y<-b}  =  {x  <-  a,  y  <-  c} 

(x  <—  x)  o  {x  <—  a}  =  {  }. 

We  write  (x  +—  t)  o  (y  +—  t')  o  6  as  an  abbreviation  for  (x  *—  t)  o  ((y  ♦—  t')  o  0) . 

The  composition  9\  of  two  substitutions  8  and  A  is  defined  by  the  properties 
{}A  =  A 

((i<-t)oj)A  =  (x  ♦—  tX)  o  (0A) 

for  all  variables  x  and  terms  t.  The  most  important  property  of  the  composition  function  is  that  applying 
the  composition  of  two  substitutions  8  and  A  to  an  expression  e  is  the  same  as  applying  first  one  and  then 
the  other;  that  is,  e(8X)  =  (ed)X.  The  empty  substitution  can  be  shown  to  be  an  identity  under  composition; 
that  is,  {  }6  —  8{  }  =  9,  for  all  substitutions  8.  Also,  composition  can  be  shown  to  be  associative;  that  is, 
0(Ap)  =  (8X)p  for  all  substitutions  9,  A,  and  p. 

The  definition  of  composition  suggests  a  way  of  computing  it.  For  example, 

{y  <-  g{z)}{y  *-  *,  z  <-  6}  =  (y  <-  g(b))  O  {y  X,  z  *-  b} 

=  {y*~  ?(&),  z<-b} 

and  therefore 

{x  y,  y  «-  y(z)}{y  X,  z  b}  =  (x  ♦-  x)  o  {y  «-  g(b),  z  *-  6} 

=  {y  *-  a  *-  6}. 

Note  that  the  composition  of  substitutions  is  not  commutative.  For  example,  {x  +—  y}{y  ■*—  x}  = 
{y  +-  x}  and  {y  x}{x  *-  y}  =  {x  «-  y},  but  {y  x}  ^  {x  y}. 

A  substitution  8  is  said  to  be  more  general  than  a  substitution  8'  if  there  exists  a  substitution  A  such  that 
8 X  =  6'.  For  example,  the  substitution  8  :  {x  «—  y}  is  more  general  than  the  substitution  8'  :  {x  *—  a,  y  *—  a}, 
because 

0{y<-a}  =  {x  *-  y}{y  ♦-  a}  =  {x<-a,  y*-a}  =  8'. 

On  the  other  hand,  8  :  {x  <—  y}  is  not  more  general  than  the  substitution  <f>  :  {x  <—  a},  because  there  is  no 
substitution  A  such  that 

0A  =  {x  •*—  y}A  =  {x  <—  a}  =  <f>. 

A  substitution  is  regarded  as  more  general  than  itself,  because  8{  }  =  8  for  any  substitution  8.  It  is 
possible  for  two  distinct  substitutions  to  be  more  general  than  each  other.  For  example,  8  :  {x  *—  y}  and 
;  {y  <—  x}  are  more  general  than  each  other,  because 

8{y^x}  =  {x  —  y}{y<-z}  =  {y  x)  =  8' 
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0'{x<-y}  =  {y  «-  x}{x  +-  y)  =  {x  *-  y}  =  9. 

UNIFIERS 

A  substitution  8  is  said  to  be  a  unifier  of  two  expressions  e  and  e  if 
e$  =  ed, 

that  is,  if  e6  and  ed  axe  identical  expressions.  Two  expressions  are  unifiable  if  they  have  a  unifier. 

For  example,  the  substitution 
8  :  {x  <—  b,  y  z} 
is  a  unifier  of  the  two  expressions 

e:f(x,z)  and  e:f(b,y), 

because  ed  =  ed  =  f(b,z).  Thus,  e  and  e  are  unifiable.  The  substitutions 
4>: 

and 

p  :  {x  <—  b,  y  *—  to,  z  *—  to} 

are  also  unifiers  of  these  two  expressions.  Thus,  unifiers  of  expressions  are  not  unique. 

The  expressions  p(a)  and  p(b)  axe  clearly  not  unifiable  and  neither  are  the  expressions  q(x,  f(x))  and 
q(g{y),  y) .  The  expressions  x  and  f(x)  axe  also  not  unifiable.  Because  x  is  a  proper  subexpression  of  f(x),  we 
know  xd  is  a  proper  subexpression  of  {f{x))d,  for  any  substitution  d;  hence  xd  and  (/(x))  d  are  not  identical. 

A  substitution  d  is  said  to  be  a  most-general  unifier  of  two  expressions  e  and  e  if  d  is  a  unifier  of  e  and  e 
and  if  d  is  more  general  than  any  unifier  of  e  and  e.  For  example,  the  distinct  substitutions  d  :  {x  *—  b,  y  *—  z} 
and  <j>  :  {x  *—  b,  z  *—  y}  axe  both  most  general  unifiers  of  the  expressions  e  :  f(x,  z)  and  e  :  f{b,y).  Thus, 
most-general  unifiers  are  not  unique.  It  is  clear,  however,  that  all  most-general  unifiers  of  two  expressions 
are  equally  general,  i.e.,  each  is  more  general  than  any  of  the  others. 

There  is  a  unification  algorithm  (Robinson  [65])  for  determining  whether  a  given  pair  of  expressions  is 
unifiable  and,  if  so,  for  producing  a  most  general  unifier. 

We  can  extend  the  notion  of  unifier  to  apply  to  a  list  of  pairs  of  expressions.  A  substitution  6  is  said  to 
be  a  simultaneous  unifier  of  the  list 

((ei,e~i),  (e2) 62)1  {en,en)) 

of  pairs  of  expressions  if 

ex 8  =  e\d ,  e^d  =  t2d,  . . . ,  and  end  =  end. 

(Note  that  we  do  not  require  that  e,-0  =  e3d,  for  distinct  i  and  j.)  We  may  also  say  that  d  is  a 
simultaneous  unifier  of  e\  and  el,  of  e2  and  e2,  ...,  and  of  en  and  en.  A  list  of  pairs  of  expressions  is 
simultaneously  unifiable  if  it  has  a  simultaneous  unifier. 

A  list  may  fail  to  be  simultaneously  unifiable  even  though  the  expressions  of  each  pair  it  contains  are 
unifiable  independently.  For  example,  the  list  of  pairs 

((x,g{y)},  (/(z).y)) 
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is  not  simultaneously  unifiable,  even  though  the  expressions  x  and  g(y)  are  unifiable,  by  the  substitution 
{x  *—  y(y)},  and  the  expressions  f(x)  and  y  are  unifiable,  by  the  substitution  {y  *—  /(a:)}. 

For  any  list  of  pairs  of  expressions,  a  simultaneous  unifier  is  most  general  if  it  is  more  general  than  any 
other  simultaneous  unifier. 

We  can  extend  the  notion  of  unifier  further  to  apply  to  a  list  of  lists  of  expressions.  A  substitution  8  is 
said  to  be  a  simultaneous  unifier  of  the  list 

A/  /V  /V 

•••)>  (e2)e2>^2>  •••)>  •••>  (en)en>£n>  •••))> 

of  lists  of  expressions  if 

e\ 9  =  e[ 9  =  e[9  =  ... 
e2&  —  £28  —  £2  8  =  ... 


We  may  also  say  that  8  is  a  simultaneous  unifer  of  ei,£i,e[,  ...,  of  . . and  of  en,en^en . 

The  notion  of  most-general  simultaneous  unifier  and  the  unification  algorithm  may  be  extended  accordingly. 
The  notation  is  more  complex  but  the  concepts  are  the  same. 


SUBSTITUTION  AND  REPLACEMENT 

We  sometimes  find  it  convenient  to  use  the  replacement  and  substitution  notations  together.  Suppose  s,  t, 
and  e  are  expressions,  where  s  and  t  are  either  both  sentences  or  both  terms.  Let  8  be  a  substitution.  If  we 
write  e  as  e[s],  then 

ed\t\ 

denotes  the  expression  obtained  by  replacing  every  occurrence  of  s&  in  ed  with  t.  If  we  write  e  as  e(s),  then 
e8(t) 

denotes  the  expression  obtained  by  replacing  certain  (perhaps  none)  of  the  occurrences  of  sd  in  ed  with  t. 
For  example,  consider  the  expression 

e:  p(f(x,a))  or  q(f(x,y))  or  r(f(b,a)) 
and  the  substitution 

6  :  {x  *—  b,  y  <—  a}. 

If  we  write  e  as  e\f(x,  a)],  then  ed[g[c)]  is 
p(sKc))  or  q(g(c))  or  r(y(c)). 

Note  that  two  of  the  replaced  occurrences  of  f(x ,  a)9  in  ed  do  not  correspond  to  occurrences  of  f(x,  a)  in  e; 
they  were  created  by  application  of  the  substitution  9. 

INTERPRETATIONS 

We  shall  use  the  Herbrand  notion  of  interpretation,  in  which  the  elements  of  the  domain  are  identified  with 
the  terms  of  the  language. 
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An  interpretation,  I  is  an  assignment  of  truth  values,  either  T  (true)  or  F  (false),  to  every  ground 
proposition  (i.e.,  to  every  proposition  that  contains  no  variables).  If  I  assigns  T  [or  F]  to  a  ground  proposition, 
that  proposition  is  said  to  be  true  [or  false ]  under  I.  The  truth  [or  falseness]  of  a  nonpropositional  ground 
sentence  under  an  interpretation  I  may  be  determined  from  that  of  its  propositional  constituents  by  the 
familiar  semantic  rules  for  the  logical  connectives. 

A  nonground  sentence  P  is  true  under  I  if  every  ground  instance  of  P  is  true  under  J ;  otherwise,  P  is 
false  under  1 .  Note  that,  according  to  this  definition,  free  variables  have  a  tacit  universal  quantification. 

We  can  now  define  the  notions  of  implication  and  equivalence  between  sentences.  The  sentences 
Pi,  P'2,  Pz,  ...  imply  a  sentence  Q  if,  for  any  interpretation  I, 

if  Pi i  P2,  P2>  •  •  •  are  all  true  under  I, 
then  <2  is  true  under  I . 

Note  that  if  P  implies  Q,  it  is  not  necessarily  the  case  that  the  sentence  ( if  P  then  Q)  is  valid.  For 
example,  p(x)  implies  p(a),  because  free  variables  are  taken  to  be  universally  quantified.  But  the  sentence 
(if  p(x)  then  p(a))  is  not  valid:  its  instance  (if  p(b)  then  p(a))  is  false  under  any  interpretation  for  which 
p(b)  is  true  and  p(a)  is  false. 

Two  sentences  P  and  Q  are  equivalent  if,  for  any  interpretation  J, 

P  is  true  under  I 
if  and  only  if 
Q  is  true  under  J. 

Hence  P  is  equivalent  to  Q  if  P  implies  Q  and  0  implies  P .  For  example,  the  sentences  p(x)  and  p(y)  are 
equivalent. 

Lemma  (instantiation) 

For  any  sentence  7  and  substitution  9,  7  implies  76. 

Both  total  and  partial  replacement  exhibit  the  following  value  property: 

Suppose  P,  Q,  and  7  are  ground  sentences  and  J  is  an  interpretation.  Then 

if  P  and  Q  have  the  same  truth  value  under  I, 

then  7\P\  and  7\0\  have  the  same  truth  value  under  I. 

Also, 

if  P  and  Q  have  the  same  truth  value  under  I, 

then  7(P)  and  7(0.)  have  the  same  truth  value  under  I. 

A  corresponding  value  property  applies  to  multiple  replacements. 

Remark 

The  value  property  applies  only  to  ground  sentences,  not  to  sentences  with  variables.  For  instance,  let 
P  be  the  sentence  p(x),  let  Q  be  the  sentence  false,  and  let  7\P\  be  the  sentence  (notp(x)).  Consider  an 
interpretation  I  under  which 

p(a)  is  true  and  p(b)  is  false. 

Then  (by  the  definition  of  truth  for  a  nonground  sentence)  p(x)  is  false  under  J  and  hence 
p(x)  and  false  have  the  same  truth  value  under  I. 
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On  the  other  hand  (by  the  definition  again)  notp[x)  is  also  false  under  1  and  hence 
(not  p(x))  and  ( not  false)  do  not  have  the  same  truth  value  under  I, 
contradicting  the  conclusion  of  the  value  property. 

THEORIES 

A  theory  is  a  set  of  sentences  T  that  is  closed  under  logical  implication:  If  T  implies  a  sentence  P  then  P 
belongs  to  T.  A  member  of  a  theory  T  is  also  said  to  be  valid  in  T. 

A  theory  T  is  said  to  be  defined  by  a  set  of  sentences  A  if  T  is  precisely  the  set  of  sentences  implied  by 
A.  We  shall  also  say  that  A  is  a  set  of  axioms  for  T. 

An  interpretation  I  is  said  to  be  a  model  for  a  theory  7  if  every  sentence  of  T  is  true  under  I. 

For  example,  let  T  be  the  set  of  sentences  implied  by  the  transitivity  axiom, 

if  x  <y  and  y  <  z 
then  x  -<  z, 

and  the  irreflexivity  axiom, 
not  x  -<  x. 

Then  7  is  a  theory,  defined  by  these  axioms.  The  asymmetry  property 

if  x  -<y 
then  not  y  <  x 

is  a  (valid)  sentence  of  this  theory. 

RELATIONS 

We  need  some  special  terminology  for  speaking  about  relations.  Henceforth,  let  us  consider  a  particular 
theory.  When  we  speak  of  validity,  we  shall  mean  validity  in  that  theory. 

Let  p  and  q  be  n-ary  relations.  Then  we  say  that  p  implies  q  if 

if  p{x i,x2,  ...,xn)  then  q(x i,x2,  ...,xn) 

is  valid  (in  the  theory  under  discussion).  We  also  say  that  p  is  equivalent  to  g  if 

p(xi,  X2,  •  ■  •  ,  Xn)  —  9(2'1)  x2i  •  ■  •  >  2-ra) 

is  valid. 

Let  -i  be  an  arbitrary  binary  relation.  We  shall  say  that,  over  a  given  theory,  is  reflexive  if 
x  -4  x 

is  valid  (in  the  theory);  -i  is  irrefiexive  if 
not  (x  -<  x) 
is  valid;  -i  is  total  if 


x  -4  y  or  x  =  y  or  y  -i  x 
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is  valid;  -4  is  transitive  if 

if  (x  -4  y  and  y  -4  z)  then  x  -4  z 
is  valid;  and  -4  is  symmetric  if 
if  x  -4  y  then  y  -4  x 

is  valid. 

We  regard  logical  connectives  as  relations  on  the  set  of  truth  values  {T,  f}.  For  instance,  the  implication 
connective  ( ifP  then  Q.)  is  the  relation  that  holds  if  P  has  value  F  or  if  P  and  Q  both  have  value  T;  we  may 
read  it  as  “P  is  falser  than  (or  as  false  as)  Q.”  The  equivalence  connective  P  =  Q  is  simply  the  equality 
relation  on  {T,  F}.  Note  that,  viewed  as  binary  relations,  the  implication  connective  if-then  is  reflexive,  total, 
and  transitive,  and  the  equivalence  connective  =  is  reflexive,  transitive,  and  symmetric. 


ASSOCIATED  RELATIONS 

For  each  binary  relation,  we  shall  be  concerned  with  certain  associated  relations. 

Consider  an  arbitrary  binary  relation  x  -4  y  (read  as  ux  is  related  to  y”).  The  reflexive  closure  ^4  of  -4 
is  defined  by 

x  d  y  =  (x  -4  y  or  x  =  y). 

The  irreflexive  restriction  -<  of  ~4  is  defined  by 
x  -<  y  =  (x  -4  y  and  not  {x  =  y)) . 

The  inverse  »-  of  -4  is  defined  by 
x>-  y  =  y  *-  x. 

The  negation  yk  of  -4  is  defined  by 
x  yk  y  =  not(x-4  y). 

We  use  >-  and  >z  to  denote  the  inverses  of  -<  and  respectively,  and  yk  and  yk  to  denote  their  negations.  If 
we  are  using  the  prefix  notation  p(x,y)  for  a  binary  relation,  we  denote  its  reflexive  closure  by  p(x,  y),  its 

irreflexive  restriction  by  p(x,  y),  and  its  negation  by  j((x,  y). 

The  following  proposition  connects  the  relations  associated  with  a  given  binary  relation: 

Proposition  (negation  of  associated  relations) 

Consider  an  arbitrary  binary  relation  -4. 

The  negation  yf  of  the  reflexive  closure  of  -4  is  equivalent  to  the  irreflexive  restriction  of  its  negation 
yk,  that  is, 

i  ^  y  if  and  only  if  (i^y  and  not  (x  =  y )). 

The  negation  yk  of  the  irreflexive  restriction  of  -<  is  equivalent  to  the  reflexive  closure  of  its  negation 
yk,  that  is, 

if  and  only  if  (x  y k  y  or  x  =  y) . 
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We  are  now  ready  to  define  our  key  notion,  the  polarity  of  a  subexpression  with  respect  to  a  given  binary 
relation.  We  actually  define  the  polarity  of  a  subexpression  with  respect  to  two  binary  relations,  and 
-*.2-  This  notion  is  to  be  defined  so  that,  if  the  subexpression  is  positive,  replacing  that  subexpression  with  a 
larger  expression  (with  respect  to  -<i)  will  make  the  entire  expression  larger  (with  respect  to  -<2)-  Similarly, 
if  the  subexpression  is  negative,  replacing  that  subexpression  with  a  smaller  expression  (with  respect  to  -<i) 
will  make  the  entire  expression  larger  (with  respect  to  -*2). 

We  begin  by  defining  polarity  for  the  arguments  of  an  operator  (i.e.,  function  or  relation). 

Definition  (polarity  of  an  operator) 

Let  /  be  an  n-ary  operator  and  -<1  and  -< 2  be  binary  relations.  Then 

•  /  is  positive  over  its  ith  argument  with  respect  to  -<1  and  -<2  if  the  sentence 

if  x-t  1  y 

then  }{z\ , ...,  i,  x,  Zi+i, ...,  zn)  f(zi,  1,  y,  z»+i>  •  •■>£«) 

is  valid.  In  other  words,  replacing  x  with  a  larger  element  y  makes 

f(z  1,  •••!  zi  —  1)  X,  2i+l)  ...)  Zn) 
larger. 

•  /  is  negative  over  its  ith  argument  with  respect  to  and  -<2  if  the  sentence 

if  x-i  1  y 

then  /(*i,.  ..,zi-i,y,zi+i,...,zn)  -*2  /(zi, *j-i,  x,  zi+u ...,  z„) 
is  valid.  In  other  words,  replacing  y  with  a  smaller  element  x  makes 

f{zii  •••> Zi— 1,  y,  1, ...,  zn) 
larger. 

We  illustrate  this  notion  with  two  examples. 

Example 

Suppose  our  theory  includes  the  finite  sets  and  the  nonnegative  integers.  Take  f(z)  to  be  the  cardinality 
function  card(z),  which  maps  each  set  into  the  number  of  elements  it  contains.  Take  -<1  to  be  the  subset 
relation  C  over  the  finite  sets  and  -<2  to  be  the  weak  less-than  relation  <  over  the  nonnegative  integers. 

Then  the  card  function  is  positive  over  its  first  (and  only)  argument  with  respect  to  the  relations  C  and 
<,  because  the  sentence 

if  x  C  y 

then  card(x)  <  card{y ) 
is  valid  (in  the  theory). 


Example 

Consider  the  theory  of  the  integers.  Take  f[zi,Z2)  to  be  the  less-than  relation  zi  <  Z2-  Take  x  -<1  y  to 
be  the  predecessor  relation  x  <pred  y>  which  holds  if  x  ~  y  —  1,  and  take  -<2  to  be  the  if-then  connective. 
(Recall  that  we  regard  connectives  as  relations  on  the  set  of  truth  values.) 
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Then  the  less-than  relation  <  is  negative  over  its  first  argument  with  respect  to  -<prea  and  if-then , 
because  the  sentence 

if  X  ~^pred  y 

then  if  y  <  Z2  then  x  <  z2 

is  valid.  Also,  <  is  positive  over  its  second  argument  with  respect  to  -<pre<j  and  if-then,  because  the  sentence 

if  X  ~^pred  y 

then  if  zi  <  x  then  z\  <  y 

is  valid. 


It  follows  from  the  definition  that,  for  any  n-ary  operator  /  and  binary  relations  -<i  and  -<2> 

/  is  positive  over  its  ith  argument  with  respect  to  and  -<2 
if  and  only  if 

/  is  negative  over  its  ith  argument  with  respect  to  >-1  and  -<2 
if  and  only  if 

/  is  negative  over  its  ith  argument  with  respect  to  -<1  and  >-2 
if  and  only  if 

/  is  positive  over  its  ith  argument  with  respect  to  >-1  and  ^-2- 

When  we  say  that  a  relation  p(zi,  . . .  ,zn)  is  positive  or  negative  over  its  ith  argument  with  respect 
to  a  single  relation  -<1,  without  mentioning  a  second  relation  -<2,  we  shall  by  convention  take  -<2  to  be  the 
if-then  connective.  Thus  in  the  above  example  we  may  simply  say  that  <  is  negative  over  its  first  argument 
and  positive  over  its  second  argument,  with  respect  to  <prcd- 

Every  relation  is  both  positive  and  negative  over  each  of  its  arguments  with  respect  to  the  equality 
relation  =,  because  the  sentences 

if  x  =  y  if  x  =  y 

then  if  p[zi,...,x,...,zn)  and  then  if  p(z1,...,y . zn) 

then  p[zi,...,y,...,zn)  then  p(zi, x, zn) 

are  valid.  This  is  equivalent  to  the  relational-substitutivity  property  of  equality.  Also,  every  function  is  both 
positive  and  negative  over  each  of  its  arguments  with  respect  to  =  and  =,  because  the  sentences 

if  x  =  y  and  if  x  =  y 

then  f(z\ , ...,  x, ...,  zn)  —  f(zi , ...,  y, ...,  then  f(z\, ...,  y, zn)  =  /(zi, x,  ...,zn) 

are  valid.  This  is  equivalent  to  the  functional-substitutivity  property  of  equality. 

Every  connective  is  both  positive  and  negative  over  all  its  arguments  with  respect  to  =.  For  example, 
the  not  connective  is  both  positive  and  negative  over  its  argument  with  respect  to  =,  because  both  sentences 

if  x  =  y  and  if  x=y 

then  if  (not  x)  then  (not  y)  then  if  ( noty )  then  (not  x) 

are  valid. 

When  we  say  that  a  connective  is  positive  or  negative  over  its  ith  argument,  without  mentioning  any 
relations  -<1  and  -i2  at  all,  we  shall  by  convention  take  both  -*1  and  -<2  to  be  the  if-then  connective.  Polarity 
in  this  sense  is  close  to  its  ordinary  use  in  logic.  The  negation  connective  not  is  negative  in  its  first  (and 
only)  argument,  because  the  sentence 

if  if  x  then  y 

then  if  (noty)  then  (not  x) 
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is  valid.  The  conjunction  connective  and  and  the  disjunction  connective  or  axe  positive  over  both  their 
arguments.  The  implication  connective  if-then  is  negative  in  its  first  argument,  but  positive  in  its  second. 
The  equivalence  connective  =  has  no  polarity  in  either  argument.  The  conditional  connective  if-then-else 
has  no  polarity  in  its  first  argument,  but  is  positive  in  its  second  and  third  argument. 

Note  that  a  binary  relation  -4  is  transitive  if  and  only  if  it  is  negative  with  respect  to  -4  itself  over  its 
first  argument,  because  the  polarity  condition 

if  x  -4  y 

then  if  y  -4  z  then  x  -4  z 

is  equivalent  to  the  definition  of  transitivity.  Also,  -4  is  transitive  if  and  only  if  it  is  positive  with  respect  to 
-4  over  its  second  argument. 

We  are  now  ready  to  define  polarity  for  the  subexpressions  of  a  given  expression.  The  definition  is 
inductive. 

Definition  (polarity  of  a  subexpression) 

Let  -<i  and  -42  be  binary  relations.  Then 

•  An  expression  s  is  positive  in  s  itself  with  respect  to  -4\  and  -<2  if  “<i  implies  -<2- 

•  An  expression  s  is  negative  in  s  itself  with  respect  to  -<1  and  -<2  if  -<1  implies  >-2. 

Let  /  be  an  n-ary  operator  and  ei,  e2,  . . . ,  e„  be  expressions.  Consider  an  occurrence  of  s  in  one 
of  the  expressions  e* .  Then 

•  The  occurrence  of  s  is  positive  in  f(e  1^2,  ...  ,en)  with  respect  to  -<1  and  -42  if  there 
exists  a  binary  relation  -4  such  that 

the  polarity  of  the  occurrence  of  s  in  e,-  with  respect  to  -<1  and  -4 
is  the  same  as 

the  polarity  of  /  over  its  tth  argument  with  respect  to  -4  and  -<2- 

•  The  occurrence  of  s  is  negative  in  /(e i,e2,  . . .  ,  e„)  with  respect  to  -<1  and  -<2  if  there 
exists  a  binary  relation  -4  such  that 

the  polarity  of  the  occurrence  of  s  in  e<  with  respect  to  -<1  and  -4 
is  opposite  to 

the  polarity  of  /  over  its  tth  argument  with  respect  to  -4  and  -42. 

Furthermore,  if  /  has  no  polarity  over  its  tth  argument  or  if  s  has  no  polarity  in  e,-,  then  s  has 
no  polarity  in  /(e  1, 62,  . . . ,  en).  On  the  other  hand,  if  s  has  both  polarities  in  e<  and  /  has  some 
polarity  over  its  i  argument,  or  if  /  has  both  polarities  over  its  tth  argument  and  s  has  some  polarity 
in  e<,  then  s  automatically  has  both  polarities  in  /(e  1,  e2,  . . . ,  e„). 


Remark 

For  any  binary  relation  any  expression  s  is  positive  in  s  itself  with  respect  to  -4  and  -4  (because  -4 
implies  -4).  Similarly,  s  is  negative  in  s  with  respect  to  -4  and  t-. 

If  /  is  positive  over  its  tth  argument  with  respect  to  -<1  and  -<2,  then,  for  any  expressions  ei,  e2,  •  •  • ,  e„, 
the  occurrence  of  e<  is  positive  in /(ei,  . . . ,  e*,  ...,  en)  with  respect  to  -<1  and  -42-  For  take  -4  to  be  -<1.  Then 
the  polarity  of  e i  in  ei  itself  is  positive  with  respect  to  -<1  and  -<1.  Also,  /  is  positive  over  its  tth  argument 
with  respect  to  -<1  and  -42.  Because  these  two  polarities  are  the  same,  Cj  is  positive  in  f(e  1,  . . . ,  e<,  . . . ,  e„) 
with  respect  to  -<i  and  -<2- 
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Similarly,  if  /  is  negative  over  its  ith  argument,  then  e,-  is  negative  in  f(e i,  . . . ,  ,  . . . ,  en),  with  respect 

to  Hi  and  -<2.  j 


We  may  indicate  the  polarity  of  a  subexpression  s  by  annotating  it  s+,  s  ,  or  s±. 

For  example,  suppose  our  theory  includes  the  theories  of  sets  and  nonnegative  integers.  The  occurrence 
of  a  in  the  sentence 

card(s~ )  <  m 

is  negative  with  respect  to  the  subset  relation  C  and  the  if-then  connective.  For  note  that  card  is  positive 
over  its  argument  with  respect  to  C  and  <  and  that  <  is  negative  over  its  first  argument  with  respect  to  < 
and  if-then.  Therefore,  by  our  remark,  we  know  that  a  is  positive  in  card(a)  with  respect  to  C  and  <  and 
that  card(s)  is  negative  in  card(s)  <  m  with  respect  to  <  and  if-then.  By  the  definition,  taking  Hi  to  be 
C,  H  to  be  <,  and  H2  to  be  if-then,  we  conclude  that  a  is  negative  in  card[s)  <  m  with  respect  to  C  and 
if-then. 

When  we  say  that  an  occurrence  of  a  subexpression  is  positive  or  negative  in  a  sentence  with  respect 
to  a  single  relation  Hi,  without  mentioning  a  second  relation  H2,  we  shall  again  take  H2  to  be  the  if-then 
connective.  When  we  say  that  an  occurrence  of  a  subsentence  is  positive  or  negative  in  a  sentence,  without 
mentioning  any  relation  at  all,  we  shall  again  take  both  Hi  and  H2  to  be  if-then. 

It  can  be  established  from  the  definition  that,  for  expressions  s  and  t  and  binary  relations  Hi  and  H2, 

an  occurrence  of  s  is  positive  in  t  with  respect  to  Hi  and  H2 
if  and  only  if 

the  occurrence  of  s  is  negative  in  t  with  respect  to  >-1  and  H2 
if  and  only  if 

the  occurrence  of  s  is  negative  in  t  with  respect  to  Hi  and  2 
if  and  only  if 

the  occurrence  of  s  is  positive  in  t  with  respect  to  >-1  and  >~2- 
This  is  analogous  to  our  previous  result  concerning  polarity  for  the  argument  of  an  operator. 

Suppose  an  occurrence  of  a  is  positive  [or  negative]  in  t  with  respect  to  Hi  and  H2.  Then  if  Hi  is  a 
binary  relation  that  implies  Hi,  then  a  is  positive  [or  negative,  respectively]  in  t  with  respect  to  Hi  and  H2. 
Similarly,  if  H2  implies  a  binary  relation  H2,  then  a  is  positive  [or  negative,  respectively]  in  t  with  respect 
to  Hi  and  H2. 

We  can  establish  the  following  result: 

Lemma  (polarity  operator) 

Let  Hi  and  H2  be  binary  relations,  /  be  an  n-ary  operator,  and  ei,  e2,  . . . ,  en  be  expressions.  Con¬ 
sider  an  occurrence  of  s  in  one  of  the  expressions  e*  such  that  s  has  some  polarity  in  f(e  1,  e2,  • . . ,  en) 

with  respect  to  Hi  and  H2. 

Then  there  exists  a  binary  relation  H  such  that 

/  is  positive  over  its  ith  argument  with  respect  to  H  and  H2 


and 


the  polarity  of  the  occurrence  of  s  in  f(e  1,  e?,  . . . ,  e„)  with  respect  to  Hi  and  H2 
is  the  same  as 

the  polarity  of  the  occurrence  of  s  in  e,-  with  respect  to  Hi  and  H. 
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Proof 

Consider  the  case  in  which  the  occurrence  of  s  is  positive  in  f[e i,e^,  . . .  ,  en)  with  respect  to  -<i  and 
-<2-  According  to  the  definition,  this  means  that  there  exists  a  binary  relation  -i  such  that 

the  polarity  of  the  occurrence  of  s  in  e*  with  respect  to  -<i  and  -( 
is  the  same  as 

the  polarity  of  /  over  its  tth  argument  with  respect  to  and  -<2- 

If  /  is  positive  over  its  tth  argument  with  respect  to  -4  and  -<2,  then  the  occurrence  of  s  is  positive  in 
et-  with  respect  to  -<i  and  A,  and  we  can  simply  take  -4  to  be  -*■. 

On  the  other  hand,  if  /  is  negative  over  its  tth  argument  with  respect  to  A  and  -*2,  then  the  occurrence 
of  s  is  negative  in  et-  with  respect  to  -<i  and  -i.  By  previous  remarks,  this  means  that  /  is  positive  over  its 
tth  argument  with  respect  to  the  inverse  relation  y-  and  -<2)  and  the  occurrence  of  s  is  positive  in  e<  with 
respect  to  -<i  and  the  inverse  relation  £•.  Hence  we  can  take  to  be  £•. 

The  case  in  which  s  is  negative  in  f[e i,  e2>  •  •  ■ ,  en)  is  treated  similarly.  ^ 


Polarities  of  subexpressions  of  subexpressions  can  be  composed  according  to  the  following  result. 
Lemma  (polarity  composition) 

Consider  an  occurrence  of  a  subexpression  r  in  an  expression  s  and  an  occurrence  of  s  in  an 
expression  t.  Then  the  polarity  of  the  occurrence  of  r  is  positive  [or  negative]  in  t  with  respect  to 
binary  relations  -<i  and  -<2  if  and  only  if  there  exists  a  binary  relation  -4  such  that 

the  polarity  of  the  occurrence  of  r  in  s  with  respect  to  -<1  and  -< 
is  the  same  as  [or  opposite  to,  respectively] 
the  polarity  of  the  occurrence  of  s  in  t  with  respect  to  and  -<2- 


For  instance,  if  r  is  negative  in  s  and  s  is  negative  in  t  then  r  is  positive  in  t,  with  respect  to  the 
appropriate  binary  relations.  If  r  has  both  polarities  in  s  and  s  has  some  polarity  in  t,  then  r  has  both 
polarities  in  t. 

We  can  now  establish  the  fundamental  property  of  polarity. 

Lemma  (polarity  replacement) 

For  any  binary  relations  and  H 2  and  expression  e{x+,  y~),  the  sentence 
if  x  -<iy 

then  e(x+ ,  y_)  -<2  e(y+,  a;-)1 

is  valid.  Here  e(y+,  a:-)1  is  the  result  of  replacing  in  e(x+,  y~)  precisely  one  positive  occurrence 
of  x  with  y  or  negative  occurrence  of  y  with  x  (we  assume  that  such  an  occurrence  exists)  where 
the  polarity  is  taken  in  e{x+,  y~)  with  respect  to  -<1  and  -<2. 


Example 

Suppose  our  theory  includes  the  theories  of  lists  and  nonnegative  integers.  Take  -<1  to  be  the  tail 
relation  x  -<taii  y>  which  is  true  if 

not  (y  =  [  ])  and  x  =  tail{y), 
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that  is,  if  y  is  nonempty  and  x  is  the  list  of  all  but  the  first  element  of  y.  Take  -42  to  be  the  predecessor 
relation  <pTed-  Take  e(x+ ,  y~)  to  be  the  expression 

length{x+)  +  length(x+), 

where  the  function  length(x)  yields  the  number  of  elements  in  the  list  x. 

Note  that  each  occurrence  of  x  is  positive  in  length[x)  +  length(x)  with  respect  to  <taii  and  <Pred,  as 
indicated  by  the  annotations.  For,  each  occurrence  is  positive  in  length(x)  with  respect  to  -<taii  and  -<pred, 
and  the  plus  function  +  is  positive  over  either  of  its  arguments  with  respect  to  <pred  and  <pTed- 

Therefore,  according  to  the  lemma,  the  sentence 

if  x  -<taii  y 

then  length{x )  +  length{x)  <pred  length[y)  +  length{x ) 

is  valid,  because  length{y)  +  length{x )  is  the  result  of  replacing  one  positive  occurrence  of  x  in  length(x)  + 
length(x)  with  y.  Also,  according  to  the  lemma,  the  sentence 

if  x  -<tau  y 

then  length(x)  +  length[x)  -4pred  length(x)  +  length( y) 

is  valid,  because  length[x)  +  length[y)  is  the  result  of  replacing  one  positive  occurrence  of  x  in  length[x)  + 
length(x)  with  y. 

On  the  other  hand,  the  lemma  does  not  allow  us  to  conclude  that 

if  x  <taii  y 

then  length(x)  +  length(x)  <pred  length(y)  +  length[y ) 

is  valid,  because  length[y)  +  length{y)  is  obtained  by  replacing  two,  not  one,  positive  occurrences  of  a;  in 
length(x)  +  length(x)  with  y.  In  fact,  this  third  sentence  is  not  valid. 


We  now  prove  the  lemma. 

Proof  (polarity  replacement  lemma) 

For  any  arbitrary  binary  relation  -<i,  suppose  that 
x  -<1  y. 

We  show  that,  for  any  expression  e{x+,  y~),  we  have,  for  any  binary  relation  -<2, 
e{x+,  y~)  -<2  e(y+,  x-)1. 

The  proof  is  by  induction  on  the  structure  of  e(x+ ,  y~).  In  other  words,  we  show  the  desired  conclusion 
for  an  arbitrary  expression  e(x+ ,  y~),  under  the  induction  hypothesis  that,  for  any  proper  subexpression 
e'(x+,  y~)  of  e(x+,  y~),  we  have,  for  any  binary  relation  -4 2 , 

e(x+,  y~ )  32  e(y+,  x-)1. 

As  in  the  statement  of  the  lemma,  e(y+ ,  x-)1  is  obtained  from  e{x+,y~)  by  replacing  precisely  one  occurrence 
of  x  or  y,  of  suitable  polarity  with  respect  to  -<1  and  -<2- 

The  proof  distinguishes  among  several  subcases. 

Case :  The  expression  e(x+,  y~)  is  simply  x 

Then,  because  the  replaced  variable  x  is  positive  in  x,  with  respect  to  -<1  and  -42,  we  have  (by  the 
definition  of  polarity)  that  -<1  implies  -42. 
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In  this  case,  e(y+,  x  )*  is  y,  and  we  must  show 


x  h2  y- 

But  this  follows  from  our  supposition  that  x  Hi  y,  because  Hi  implies  H2. 

Case:  The  expression  e(x+,  y~)  is  simply  y 

Then,  because  the  replaced  variable  y  is  negative  in  y  with  respect  to  Hi  and  h2)  we  have  (by  the 
definition  of  polarity)  that  Hi  implies  >-2. 

In  this  case,  e(y+ ,  x~)  is  x,  and  we  must  show  that 

V  “<2  x, 

or,  equivalently,  that 

x  *-2  y- 

But  this  follows  from  our  supposition  that  x  Hi  y,  because  Hi  implies  >-2. 

Case:  e(x+ ,  y-)  is  of  form  f[e  1, 62,  . . . ,  en),  where  /  is  an  n-ary  operator 

The  replaced  occurrence  of  x  [or  y]  must  occur  in  one  of  the  arguments  e<  of  /.  Because  this  occurrence 
is  positive  [or  negative,  respectively]  in  f(e  1,  e2,  . . . ,  en)  with  respect  to  Hi  and  H2,  we  know  (by  the  polarity 
operator  lemma)  that  there  exists  a  binary  relation  H  such  that 

/  is  positive  over  its  tth  argument  with  resect  to  H  and  h2 

and 


the  polarity  of  the  replaced  occurrence  of  x  [or  y]  in  with  respect  to  Hi  and  H 
is  the  same  as 

the  polarity  of  the  replaced  occurrence  of  x  [or  y]  in  f[e  1,  e2,  . . . ,  e„),  that  is, 
e{x+,  y~),  with  respect  to  Hi  and  H2. 

Let  us  therefore  write  e,-  as  e<(a:+,  y“). 

Because  ei(x+ ,  y~)  is  a  proper  subexpression  of  e(x+,  y~),  we  can  apply  our  induction  hypothesis, 
taking  e'(x+,  y~ )  to  be  e,(x+,  y~)  and  H2  to  be  H,  to  conclude  that 

ei(x+,  y~)  H  ei (y+ ,  x~)1. 

Therefore  (by  the  definition  of  polarity  of  an  operator,  because  /  is  positive  over  its  tth  argument  with 
respect  to  H  and  H2),  we  have 

/(ei,...,e,(x+,  y~),...,en)  H2  /(ei,...,ei(y+,  x~), ...,  e„), 

that  is, 

e{x+,  y~)  -<2  e(y+,  x~)x, 
as  we  wanted  to  show.  This  completes  the  proof. 


The  polarity  replacement  lemma  allows  us  to  replace  precisely  one  occurrence  of  a  variable.  If  we  know 
more  about  the  relation  H2,  we  can  establish  stronger  versions  of  the  lemma.  In  particular,  if  we  know  that 
H2  is  transitive,  we  can  replace  one  or  more  occurrences  of  the  variable. 

Lemma  (transitive  polarity  replacement) 
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For  any  binary  relations  -<i  and  <2  and  expression  e(x+,  y  ),  where  -<2  is  transitive,  the  sentence 
if  x<xy 

then  e{x+,  y~)  -<2  e(y+,  x~)n 

is  valid  for  every  positive  integer  n.  Here  e(y+ ,  x~)n  is  the  result  of  replacing  in  e{x+ ,  y~)  precisely 
n  positive  occurrences  of  x  with  y  or  negative  occurrences  of  y  with  x,  where  the  polarity  is  taken 
in  e(x+ ,  y~)  with  respect  to  -<1  and  -<2-  j 

Note  that  we  can  replace  occurrences  of  both  x  and  y  in  the  same  expression;  precisely  n  replacements 
are  made  altogether.  Also,  the  lemma  requires  that  at  least  one  replacement  be  made. 

Example 

Suppose  our  theory  includes  the  theories  of  both  lists  and  integers.  Take  e(x+,  y~)  to  be  the  expression 

e(x+,  y~)  :  length(x+)  +  ( length[x+ )  —  length[y~)). 

Take  -<1  to  be  the  tail  relation  -<taii  (defined  in  a  previous  example)  and  -<2  to  be  the  less-than  relation 
<.  Note  that,  with  respect  to  and  <,  both  occurrences  of  x  are  positive  and  the  occurrence  of  y  is 
negative  in  e(z+,  y-);  also  <  is  transitive.  According  to  the  lemma,  the  following  sentences  (among  others) 
are  valid:  the  sentence 

if  x  -<taii  y 

then  length(x)  +  (length(x)  —  length(y))  <  length(y)  +  ( length[y )  —  length{y)), 
for  which  both  occurrences  of  x  in  e(x+ ,  y~)  have  been  replaced,  and 

if  x  -<ta.ii  y 

then  length[x )  +  [length{ x)  —  length(y))  <  length(x)  +  ( length[y )  —  length(x)), 
for  which  one  occurrence  of  x  and  one  of  y  in  e(x+ ,  y-)  have  been  replaced. 

On  the  other  hand,  the  lemma  does  not  allow  us  to  conclude  that 
if  x  <tau  y 

then  length[x)  +  (length(x)  —  length{y))  <  lengtk(x)  +  ( length[x )  —  length(y)), 

is  valid,  because  no  replacements  of  x  or  of  y  in  e{x+,  y~)  have  been  replaced.  In  fact,  this  final  sentence  is 
clearly  not  valid. 

We  now  prove  the  lemma 
Proof  (transitive  polarity  replacement  lemma) 

We  assume  throughout  that  polarity  is  with  respect  to  -«i  and  -<2-  We  suppose  that 
x-<iy 

and  show  that 

e{x+,  y~)  < 2  e(y+,  x~)n, 

for  every  positive  integer  n.  The  proof  is  by  induction  on  n. 

Base  Case:  n  =  1. 

In  this  case,  precisely  one  replacement  is  made.  The  desired  result 
e(x+,  y~)  -<2  e(y+,  x-)1 

follows  from  the  original  polarity  replacement  lemma. 
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Inductive  Step: 

For  an  arbitrary  positive  integer  k,  we  assume  inductively  that 
e(x+,  y~)  -<2  e(y+,  x~)k 
and  show  that 

e(x+,  y~)  -*2  e(y+,  x~)k+1. 

Observe  that  e(y+,  x~)k+1  can  be  obtained  from  e(y+,  x~)k  by  replacing  precisely  one  positive  occur¬ 
rence  of  x  with  y  or  one  negative  occurrence  of  y  with  x.  Therefore,  by  the  original  polarity  replacement 
lemma,  we  have 

e(y+,  x~)k  -<2  e(y+,  x~)k+1. 

Because  our  induction  hypothesis  is  that  e(x+ ,  y~)  -<2  e(y+,  x~)k,  and  because  we  have  assumed  that 
-<2  is  transitive,  we  can  conclude  that 

e(z+,  y~)  -<2  e(y+,  x~)k+1, 

as  we  wanted  to  show. 


If  ~<2  is  transitive,  the  above  lemma  allows  us  to  replace  one  or  more  occurrences  of  a  variable.  If  -<2  is 
both  reflexive  and  transitive,  the  following  lemma  allows  us  to  replace  zero,  one,  or  more  occurrences. 

Lemma  (reflexive  transitive  polarity  replacement) 

For  any  binary  relations  -<1  and  -<2  and  expression  e(x+,  y~),  where  -<2  is  both  reflexive  and  transitive, 
the  sentence 

if  x-i  1  y 

then  e(x+,  y~)  -<2  e{y+ ,  x~) 

is  valid.  Here  e(y+,  x~)  is  the  result  of  replacing  in  e{x+ ,  y~)  certain  positive  occurrences  of  x  with  y  and 
certain  negative  occurrences  of  y  with  x,  where  polarity  is  taken  in  e(x+ ,  y~)  with  respect  to  -<1  and  -<2- 

J 


This  lemma,  as  opposed  to  the  transitive  polarity  replacement  lemma,  admits  the  possibility  of  replacing  no 
occurrences  at  all  of  x  or  y  in  e{x+,  y~). 

Example 

Suppose  our  theory  includes  the  theories  of  both  finite  sets  and  integers.  Take  e(x+,  y~)  to  be  the 
expression 

e(x+,  y~)  :  card(x+  ~  y~)  —  card(y~  ~  z+) 

where  x  ~  y  is  the  difference  between  the  sets  x  and  y,  that  is,  the  set  of  elements  of  x  that  do  not  belong  to 
y.  Take  -<1  to  be  the  subset  relation  C  and  -<2  to  be  the  weak  less-than  relation  <.  Note  that,  with  respect 
to  C  and  <,  both  occurrences  of  x  are  positive  and  both  occurrences  of  y  are  negative  in  e(x+,  y_),  as  the 
annotations  indicate.  Also,  <  is  both  transitive  and  reflexive. 

Therefore,  according  to  the  lemma,  the  following  sentences  are  valid:  the  sentence 

if  x  C  y 

then  card[x  ~  y)  —  card[y  ~  x)  <  card[y  ~  x)  —  card(x  ~  y), 
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for  which  all  occurrences  of  x  and  y  in  e(x+ ,  y  )  have  been  replaced,  and  the  sentence 
if  xQy 

then  card(x  ~  y)  —  card(y  ~  x)  <  card[x  ~  y)  —  card[y  ~  s), 

for  which  no  occurrences  of  x  and  y  in  e(x+ ,  y~)  have  been  replaced.  Of  course,  other  valid  sentences  can 
be  obtained  by  replacing  some,  but  not  all,  of  the  occurrences  of  x  and  y  in  e(x+ ,  y~). 


.  The  proof  is  straightforward. 

Proof  (reflexive  transitive  polarity-replacement  lemma) 

In  the  case  in  which  no  replacements  are  made,  e(y+ ,  x~)  is  identical  to  e(x+,  y~),  and  the  desired 
result  holds  because  we  have  supposed  that  -<2  is  reflexive.  In  the  case  in  which  one  or  more  replacements  are 
made,  the  desired  result  follows  from  the  transitive  polarity  replacement  lemma,  because  we  have  supposed 
that  -<2  is  also  transitive. 

The  following  consequence  of  the  polarity  replacement  lemma  will  be  used  most  frequently: 
Proposition  (polarity  replacement) 

For  any  binary  relation  -4  and  sentence  P(x+,  y-),  the  sentence 
if  x  -4.  y 

then  if  P {x+ ,  y~) 

then  P(y+,  x~) 

is  valid.  Here  P  (y+ ,  x~)  is  the  result  of  replacing  in  P(x+,  y~)  certain  positive  occurrences  of  x 
with  y  and  certain  negative  occurrences  of  y  with  x,  where  polarity  is  taken  in  P{x+,  y~)  with 
respect  to  -4. 

Recall  that,  when  we  refer  to  polarity  in  a  sentence  with  respect  to  a  single  relation  we  mean  polarity 
with  respect  to  -4  and  the  if-then  connective.  The  proposition  allows  us  to  replace  occurrences  of  both  x 
and  y  in  the  same  sentence  and  (trivially)  admits  the  possibility  that  no  replacements  are  made. 

The  proof  is  immediate. 

Proof 

Regarded  as  a  relation,  the  if-then  connective  is  reflexive  and  transitive.  The  replaced  occurrences  of 
x  and  y  are  respectively  positive  and  negative  in  P(x+,  y~)  with  respect  to  -4  and  if-then.  Therefore  the 
proposition  is  simply  an  instance  of  the  reflexive  transitive  polarity  replacement  lemma,  taking  -<1  to  be  -4, 
-<2  to  be  if-then ,  and  e{x+ ,  y~)  to  be  P(x+,  y~). 


Example 

Suppose  our  theory  includes  the  theories  of  finite  sets  and  integers.  Take  P  (x+ ,  y~ )  to  be  the  sentence 

P(x+ ,  y~)  :  a  <  card(x+  ~  y~ )  and  card(y~  ~  z+)  <  b. 

Take  -4  to  be  the  subset  relation  C.  Note  that,  with  respect  to  C,  both  occurrences  of  x  are  positive  and 
both  occurrences  of  y  are  negative  in  P(x+,  y~),  as  indicated  by  the  annotations.  Therefore,  according  to 
the  proposition,  the  following  sentences  are  valid:  the  sentence 

if  xCy 

then  if  a  <  card(x~  y)  and  card(y  ~  x)  <  b 

then  a<card{x~x)  and  card(y~y)<b, 
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for  which  one  occurrence  of  x  and  one  occurrence  of  y  in  P  (x+ ,  y  )  has  been  replaced,  the  sentence 
t/lCy 

then  if  a  <  card(x  ~  y)  and  card(y  ~  x)  <  b 

then  a  <  card(y  ~  y)  and  card[y~y)<b, 

for  which  both  occurrences  of  x  in  P  (x+ ,  y~ )  have  been  replaced,  and  the  sentence 

if  x  C  y 

then  if  a  <  card[x~  y)  and  card(y  ~  x)  <  b 

then  a  <  card(y~  x)  and  card(x  ~  y)  <  b, 

for  which  both  occurrences  of  x  and  both  occurrences  of  y  in  P  (z+ ,  y~ )  have  been  replaced. 


We  have  now  developed  the  mathematical  results  on  relational  polarity  we  need  in  order  to  introduce 
the  special-relations  rules.  But  first,  we  introduce  briskly  our  basic  nonclausal  deduction  system. 


4.  NONCLAUSAL  DEDUCTION 


In  this  section  we  present  a  basic  nonclausal  deduction  system,  without  any  special-relations  rules.  This 
system  bears  some  resemblance  to  those  of  Murray  [82]  and  Stickel  [82];  it  is  based  on  the  system  of  Manna 
and  Waldinger  [80],  but  is  simplified  in  several  respects: 

•  The  system  presented  here  is  a  refutation  system;  it  attempts  to  show  that  a  given  set  of  sentences 
is  unsatisfiable.  (The  original  system  operates  on  a  tableau  of  assertions  and  goals,  and  attempts 
to  show  that  at  least  one  of  the  goals  follows  from  the  assertions.) 

•  The  system  is  presented  with  no  program  synthesis  capabilities. 

•  The  mathematical  induction  principle  is  omitted. 

These  simplifications  have  been  made  for  purely  expository  purposes:  the  special-relations  rules  are 
compatible  with  a  tableau  theorem  prover  and  with  the  induction  principle  and  are  of  great  use  in  program 
synthesis,  our  primary  application. 


THE  DEDUCED  SET 

The  deduction  system  we  describe  operates  on  a  set,  called  the  deduced  set,  of  sentences  in  quantifier- 
free  first-order  logic.  We  attempt  to  show  that  a  given  deduced  set  is  unsatisfiable,  i.e.,  that  there  is  no 
interpretation  under  which  all  the  sentences  are  true. 

Theorem  proving  in  a  first-order  axiomatic  theory  can  be  reduced  to  showing  the  unsatisfiability  of  such 
a  set.  In  particular,  to  show  that  a  sentence  7  is  valid  in  a  theory  whose  axioms  are  A i,  A2,  •  •  • ,  Ak,  we 
can 

•  Remove  the  quantifiers  of  the  sentences  Ai,  A2,  ■  ■  ■ ,  Ak,  and  not  7,  by  skolemization  (see, 
for  example,  Chang  and  Lee  [73],  Loveland  [78],  or  Robinson  [79]). 

•  Show  the  unsatisfiability  of  the  resulting  set  of  quantifier-free  sentences. 

We  do  not  require  that  the  sentences  be  in  clausal  form;  indeed,  they  can  use  the  full  set  of  connectives  of 
propositional  logic,  including  equivalence  (=)  and  the  conditional  ( if-then-else ). 
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Example 

Consider  the  theory  of  the  strict  partial  ordering  -<,  defined  by  the  transitivity  axiom 


(Vx)  (Vy)  (Vz) 


if  x  <  y  and  y  <  z 
then  x  <  z 


and  the  irreflexivity  axiom 

(Vx)  [not  (x  -<  x)] . 

Suppose  we  would  like  to  show  that  in  this  theory  the  asymmetry  property 


(Vu)  (Vu) 


if  u  <  v 
then  not  v  -<  u 


is  valid.  It  suffices  to  show  that  the  set  of  quantifier-free  sentences 

not  [x  -<  x)  not 

is  unsatisfiable. 


if  x  -<  y  and  y  z 
then  x  <  z 


if  a  <b 
then  not  (6  -<  o)J 


If  the  truth  symbol  false  belongs  to  the  deduced  set,  the  set  is  automatically  unsatisfiable,  because  the 
sentence  false  is  not  true  under  any  interpretation. 

Because  the  variables  of  the  sentences  in  the  deduced  set  are  tacitly  quantified  universally,  we  can 
systematically  rename  them  without  changing  the  unsatisfiability  of  the  set;  that  is,  the  set  is  unsatisfiable 
before  the  renaming  if  and  only  if  it  is  unsatisfiable  afterwards.  Of  course,  we  must  replace  every  occurrence 
of  a  variable  in  the  sentence  with  the  new  variable,  and  we  must  be  careful  not  to  replace  distinct  variables 
in  the  sentence  with  the  same  variable.  The  variables  of  the  sentences  in  the  deduced  set  may  therefore  be 
standardized  apart ;  in  other  words,  we  may  rename  the  variables  of  the  sentences  so  that  no  two  of  them 
have  variables  in  common. 

For  any  sentence  7  in  the  deduced  set  and  any  substitution  9,  we  may  add  to  the  set  the  instance  76 
of  7,  without  changing  the  unsatisfiability  of  the  set.  In  particular,  if  the  deduced  set  is  unsatisfiable  after 
the  addition  of  the  new  sentence,  it  was  also  unsatisfiable  before.  Note  that  in  adding  the  new  sentence  79, 
we  do  not  remove  the  original  sentence  7. 


THE  DEDUCTIVE  PROCESS 

In  the  deductive  system  we  apply  deduction  rules,  which  add  new  sentences  to  the  deduced  set  without 
changing  its  unsatisfiability.  Deduction  rules  are  expressed  as  follows: 

7l,  7l,  ...,  7m 

7 

This  means  that,  if  the  given  sentences  7\,  7i,  . . . ,  7m  belong  to  the  deduced  set,  the  conclusion  7  may 
be  added.  Such  a  rule  is  said  to  be  sound  if  the  given  sentences  7\,  7z,  •  ■  • ,  7m  imply  the  sentence  /.  If  a 
deductive  rule  is  sound,  its  application  will  preserve  the  unsatisfiability  of  the  deduced  set. 

The  deductive  process  terminates  successfully  if  we  introduce  the  truth  symbol  false  into  the  deduced 
set.  Because  deduction  rules  preserve  unsatisfiability,  and  because  a  set  of  sentences  containing  false  is 
automatically  unsatisfiable,  this  will  imply  that  the  original  deduced  set  was  also  unsatisfiable. 

We  include  two  classes  of  deduction  rules  in  the  basic  system: 
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•  The  transformation  rules,  which  replace  subsentences  with  equivalent  sentences. 

•  The  resolution  rule,  which  performs  a  case  analysis  on  the  truth  of  matching  subsentences. 

These  rules  are  described  in  this  section.  In  later  sections,  we  augment  the  basic  system  with  two  new  classes 
of  rules: 

•  The  replacement  rules,  which  replace  subexpressions  with  other  expressions  (not  necessar¬ 
ily  equivalent  or  equal). 

•  The  matching  rules,  which  introduce  new  conditions  to  be  proved  that  enable  subexpres¬ 
sions  to  be  matched. 

We  first  describe  the  transformation  rules. 


TRANSFORMATION  RULES 

The  transformation  rules  replace  subsentences  of  the  sentences  of  our  deduced  set  with  propositionally 
equivalent,  simpler  sentences.  For  instance,  the  transformation  rule 

P  and  true  — ►  P 

replaces  a  subsentence  of  form  (P  and  true)  with  the  corresponding  sentence  of  form  P.  The  simplified 
sentence  is  then  added  to  the  deduced  set.  (Logically  speaking,  the  original  sentence  remains  in  the  deduced 
set  too,  but,  for  efficiency  of  implementation,  the  original  sentence  need  not  be  retained.) 

We  include  a  full  set  of  such  true-false  transformation  rules;  e.g., 

not  true  —*  false 

P  or  true  —*  true 

if  P  then  false  — »  not  P . 

These  rules  can  eliminate  from  a  sentence  any  occurrence  of  the  truth  symbols  true  and  false  as  a  proper 
subsentence. 

We  also  include  such  propositional  simplification  rules  as 
P  and  P  — +  P 
not  not  P  —*  P. 

These  rules  are  not  logically  necessary,  but  are  included  for  cosmetic  purposes. 

The  soundness  of  the  transformation  rules  is  evident,  because  each  produces  a  sentence  equivalent  to 
the  one  to  which  it  is  applied. 

Example 

Suppose  our  deduced  set  contains  the  sentence 

if  q(a )  then  false 
7 :  or 

(not  true)  or  (notq(a)). 

(We  omit  parentheses  when  the  structure  of  the  sentence  can  be  indicated  by  indenting.)  This  can  be 
transformed,  by  application  of  the  rule 

if  P  then  false  —* 


not  P 
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into  the  sentence 

not  q(a) 
or 

(not  true)  or  (not  9(a)), 
which  may  then  be  added  to  the  deduced  set. 

The  new  sentence  can  be  transformed  in  turn,  by  successive  application  of  the  rules 
not  true  — ►  false 
false  or  P  — < ►  P, 

P  or  P  — »  P, 
into  the  sentence 

not  q(a). 

We  shall  say  that  the  original  sentence  7  reduces  to  (not  q(a))  under  transformation. 

Our  original  system  (Manna  and  Waldinger  [80])  included  many  more  transformation  rules;  also,  then- 
operation  was  more  complex.  In  this  system,  the  role  of  these  more  complex  rules  has  been  assumed  by  the 
replacement  rule  of  Section  5. 


RESOLUTION  RULE:  GROUND  VERSION 

The  resolution  rule  applies  to  two  sentences  of  our  set,  and  performs  a  case  analysis  on  the  truth  of  a 
common  subsentence.  Instances  of  the  sentences  can  be  formed,  if  necessary,  to  create  a  common  subsentence; 
however,  we  first  present  the  ground  version  of  the  rule,  which  does  not  form  instances  of  these  sentences. 

Rule  (resolution,  ground  version) 

For  any  ground  sentences  P ,  7\P\ ,  and  Q[P\,  we  have 

m 

SIP] 

7[false ]  or  §[true] 

In  other  words,  if  7[P\  and  Q[P\  are  sentences  in  our  deduced  set  with  a  common  subsentence  P ,  we  can 
add  to  the  set  the  sentence  ( 7[false]  or  Q[true\)  obtained  by  replacing  every  occurrence  of  P  in  7\P]  with 
false,  replacing  every  occurrence  of  P  in  Q[P\  with  true,  and  taking  the  disjunction  of  the  results.  We  shall 
assume  that  7\P\  and  Q[P)  have  at  least  one  occurrence  each  of  the  subsentence  P .  We  do  not  require  that 
7\P\  and  @[P]  be  distinct  sentences. 

Because  the  resolution  rule  introduces  new  occurrences  of  the  truth  symbols  true  and  false,  it  is  always 
possible  to  simplify  the  resulting  sentence  immediately  afterwards  by  application  of  the  appropriate  true-false 
rules.  These  subsequent  transformations  will  sometimes  be  regarded  as  part  of  the  resolution  rule  itself. 


4.  Nonclausal  Deduction 


27 


Example 

Suppose  our  deduced  set  contains  the  sentences 


7  :  if  9(a)  then  p(a,  b) 


and 


§:  ( not  p(a,  b)  )  or  ( not  q(a )). 


These  sentences  have  a  common  subsentence  p[a,  6),  indicated  by  the  surrounding  boxes.  By  application  of 
the  resolution  rule,  we  may  replace  every  occurrence  of  p(o,  b)  in  7  with  false,  replace  every  occurrence  of 
p(a,  b)  in  §  with  true,  and  take  the  disjunction  of  the  result,  obtaining  the  sentence 


if  q(a )  then  false 
or 

(not  true)  or  ( not  q(a )), 

which  (as  we  have  seen  in  a  previous  example)  reduces  under  transformation  to 
not  q(a). 


This  sentence  may  be  added  to  the  deduced  set. 


Let  us  show  that  the  resolution  rule  is  sound,  and  hence  that  it  preserves  the  unsatisfiability  of  the 
deduced  set. 

Justification  (resolution  rule,  ground  version) 

We  must  show  that  the  given  sentences  7\P\  and  Q\P\  imply  the  newly  deduced  sentence  (7\false\  or 
£J[<rue]) .  Suppose  that  7\P\  and  Q\P\  are  true;  we  would  like  to  show  that  then  (7[false\  or  Q[true\)  is  true. 
We  show  that  one  of  the  two  disjuncts,  7\}alse ]  or  Q\true\,  is  true. 

In  the  case  in  which  the  common  subsentence  P  is  false,  we  know  (by  the  value  property,  because  P 
and  false  have  the  same  truth  value  and  7\P]  is  true)  that  the  first  of  the  disjuncts,  7 [false],  is  true. 

Similarly,  in  the  case  in  which  the  common  subsentence  P  is  true,  we  know  (by  the  value  property  again, 
because  P  and  true  have  the  same  truth  value  and  Q[P\  is  true)  that  the  second  of  the  disjuncts,  Q[true],  is 
true. 


We  have  established  the  soundness  of  the  ground  version  of  the  resolution  rule  when  applied  to  ground 
sentences,  which  contain  no  variables.  We  require  the  sentences  to  be  ground  because  the  justification 
depends  on  the  value  property,  which  holds  only  for  ground  sentences.  We  can  actually  apply  the  ground 
version  of  the  rule  to  sentences  with  variables;  the  soundnes  of  such  applications  follows  from  the  justification 
for  the  general  version  of  the  rule,  which  we  present  later. 

We  now  discuss  an  important  strategy  for  controlling  the  resolution  rule. 


THE  POLARITY  STRATEGY 

Murray’s  [82]  polarity  strategy  allows  us  to  consider  only  those  applications  of  the  resolution  rule  under 
which  at  least  one  occurrence  of  P  is  positive  (or  of  no  polarity)  in  7[P]  and  at  least  one  occurrence  of  P  is 
negative  (or  of  no  polarity)  in  $\P\.  In  other  words,  not  all  the  subsentences  that  are  replaced  with  false  are 
negative  and  not  all  the  subsentences  that  are  replaced  with  true  are  positive.  This  strategy  blocks  many 
useless  applications  of  the  rule  and  rarely  interferes  with  a  reasonable  step. 
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The  intuitive  rationale  for  the  polarity  strategy  is  that  it  is  our  goal  to  deduce  the  sentence  false,  which 
is  more  false  than  any  other  sentence.  By  replacing  positive  sentences  with  false  and  negative  sentences  with 
true,  we  are  moving  in  the  right  direction,  making  the  entire  sentence  more  false. 


Example 


Suppose  our  deduced  set  contains  the  sentences 


7 :  p(a)  or  q{b) 


and 


$  :  if  p(a)  then  q{b). 


These  sentences  have  occurrences  of  a  common  subsentence  p(a),  of  positive  and  negative  polarity,  respec¬ 
tively,  as  indicated  by  the  annotation.  By  application  of  the  resolution  rule,  we  obtain  the  sentence 


false  or  q(b) 
or 

if  true  then  q{b), 

which  reduces  to  q(b)  under  transformation. 


Let  us  reverse  the  roles  of  our  sentences. 


7: 

if  p(a) 

5 :  1 

p(a) 

then  g(fc) 
or  q(b). 


The  sentences  still  have  occurrences  of  a  common  subsentence  p(o).  However,  it  is  in  violation  of  the  polarity 
strategy  to  apply  the  rule  for  the  sentences  in  this  order,  because  now  the  occurrence  of  p(a)  is  negative  in 
7,  i.e.,  it  is  not  positive  or  of  no  polarity.  Also,  the  polarity  of  p(o)  is  positive  in  Q.  If  we  insist  on  applying 
the  resolution  rule  anyway,  we  obtain  the  sentence 


if  false  then  q[b) 
or 

true  or  q(b), 

which  reduces  to  true  under  transformation.  Although  it  does  no  harm  to  add  the  sentence  true  to  our 
deduced  set,  it  is  of  no  use  in  establishing  the  unsatisfiability  of  the  set. 


There  are  two  other  legal  applications  of  the  resolution  rule  to  the  same  two  sentences,  obtained  by 
taking  the  common  subsentence  to  be  q(b)  rather  than  p(a).  Both  of  these  applications  of  the  rule  lead  us 
to  obtain  the  redundant  sentence  true,  and  both  are  in  violation  of  the  polarity  strategy. 


RESOLUTION  RULE:  GENERAL  VERSION 


The  general  version  of  the  rule  allows  us  to  instantiate  the  variables  of  the  given  sentences  as  necessary 
to  create  common  subsentences.  It  is  expressed  as  follows: 


4.  Nonclausal  Deduction 
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Rule  (resolution,  general  version) 

For  any  sentences  P,  P,  7[P\,  and  Q\P\,  where  7  and  Q  are  standardized  apart,  i.e.,  they  have  no 
variables  in  common,  we  have 

m 

SIP\ 

76\false]  or  Q8\true] 

where  8  is  a  most-general  unifier  of  P  and  P. 

More  precisely, 

•  7  has  one  or  more  subsentences  P ,  Pi,  P2,  .... 

A/  /V  /V 

•  Q  has  one  or  more  subsentences  P>  P2,  •  •  •  • 

•  9  is  a  most  general  unifier  of  P ,  Pi,  P2,  ■  •  ■ ,  and  P,  Pi,  P2,  ... ;  hence 

pe  =  P18  =  P28  =  ...  =  ?6  =  Pi  9  =  P20  =  .... 

•  The  conclusion  of  the  rule  is  obtained  by  replacing  all  occurrences  of  P8  in  79  with  false, 
replacing  all  occurrences  of  P8  (that  is,  P8)  in  Q8  with  true,  and  taking  the  disjunction 
of  the  results. 

In  other  words,  we  apply  the  ground  version  of  the  rule  to  78  and  Q9,  taking  P9  as  the  common 
subsentence.  . 


The  rule  requires  that  the  sentences  7  and  Q  be  standardized  apart,  i.e.,  that  they  have  no  variables 
in  common.  This  may  be  achieved  by  renaming  the  variables  of  the  sentences  as  necessary.  If  both  are  the 
same  sentence,  we  rename  the  variables  of  one  copy  of  the  sentence. 

Let  us  show  that  the  general  version  of  the  rule  is  sound. 

Justification  (resolution  rule,  general  version): 

The  soundness  of  the  general  version  of  the  rule  follows  from  the  soundness  of  its  ground  version.  We 
show  that  the  sentences  7  and  $  imply  the  sentence  (76\false]  or  Q9[true\). 

We  suppose  that  [under  a  given  interpretation]  the  sentences  7  and  Q  are  true  and  show  that  (78\false\ 
or  ge[true\)  is  also  true.  It  suffices  (by  the  definition  of  truth  for  a  nonground  sentence)  to  show  that  any 
ground  instance  of  ( 78\false\  or  £0[true])  is  true. 

Because  7  and  Q  are  true,  we  know  (by  the  instantiation  lemma)  that  79  and  Q8  are  true  and  hence 
(by  the  definition  of  truth  for  a  nonground  sentence)  that  every  ground  instance  of  78  and  Q8  is  true.  But 
any  ground  instance  of  (78\false]  or  g8[true\)  is  the  result  of  applying  the  ground  version  of  the  rule  to  the 
corresponding  ground  instance  of  78  and  $8',  therefore  it  is  also  true. 


The  general  version  of  the  rule  includes  the  ground  version  as  a  special  case,  in  which  the  most-general 
unifier  8  is  the  empty  substitution  {  }. 

The  following  illustration  of  the  general  resolution  rule  is  extracted  from  the  derivation  of  a  binary-search 
real-number  square-root  program. 
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Example 

In  the  theory  of  the  nonnegative  real  numbers,  suppose  our  deduced  set  contains  the  sentence 
7 :  not(y2<a  and  not  (y  +  e)2<a  ), 

where  y  is  a  variable  and  a  and  e  are  constants.  (The  sentence  is  negated  because  it  is  deduced  from  the 
negation  of  the  original  theorem.) 

We  are  about  to  apply  the  resolution  rule  to  this  sentence  and  itself.  Therefore  let  us  produce  another 
copy  of  the  sentence  and  standardize  the  two  sentences  apart;  i.e.,  we  rename  the  variable  of  the  second 
sentence 


Q:  not  (  y2  <  a  and  not  ((y  +  e)2  <  a)). 

The  boxed  subsentences 

P  :  (y  +  e)2<a 

and 

P:  y2<  a 

are  unifiable,  with  most-general  unifier 

0:  {y<-y  +  c}. 

To  apply  the  rule,  we  replace  all  occurrences  of  P8  in  79  with  false,  replace  all  occurrences  of  ?9  in  Q6  with 
true,  and  take  the  disjunction  of  the  results,  obtaining 

not  (y2  <a  and  not  false) 
or 

not  (true  and  not  (((y  +  e)  +  e)2  <  a)). 

This  sentence  reduces  under  transformation  to 
not  (y2  <  a)  or  ((y  +  e)  +  c)2  <  a. 

The  above  application  of  the  rule  is  in  accordance  with  the  polarity  strategy,  because  the  boxed  sub¬ 
sentence  P  is  positive  in  7  and  the  boxed  subsentence  P  is  negative  in  Q. 


The  resolution  rule  presented  here  is  an  extension  of  the  rule  of  Robinson  [65]  to  the  nonclausal  case. 
Robinson’s  rule  applies  to  clauses  of  the  form 

7:  P  or  7' 

Q  :  ( not  P)  or 

where  P  and  P  are  unifiable  propositions,  with  most-general  unifier  9,  and  7'  and  Q'  are  themselves  clauses. 
Robinson’s  rule  deduces  the  new  sentence 

7'6  or  g'6. 

The  resolution  rule  presented  here  deduces,  from  the  same  sentences  7  and  Q,  the  new  sentence 

false  or  7'8 
or 

(not  true)  or  Q'6. 

This  sentence  reduces  under  transformation  to  (7' 8  or  Q'9),  the  same  sentence  deduced  by  Robinson’s  version 
of  the  rule. 


5.  The  Relation  Replacement  Rule 
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Nonclausal  resolution  was  developed  independently  by  Manna  and  Waldinger  [80]  and  Murray  [82].  The 
resolution  and  transformation  rules  together  have  been  shown  by  Murray  to  provide  a  complete  system  for 
first-order  logic.  An  implementation  of  a  nonclausal  resolution  theorem  prover  by  Stickel  [82]  employs  a 
connection  graph  strategy. 


5.  THE  RELATION  REPLACEMENT  RULE 


We  now  begin  to  extend  our  nonclausal  deduction  system  to  give  special  treatment  to  a  binary  relation 
-4.  The  two  new  rules  of  the  extension  allow  us  to  build  into  the  system  instances  of  the  polarity  replace¬ 
ment  proposition,  just  as  the  paramodulation  and  El-resolution  rules  allow  us  to  build  in  instances  of  the 
substitutivity  of  equality. 

Recall  that,  according  to  the  polarity  replacement  proposition,  for  any  sentence  P(x+ ,  y~)  and  binary 
relation  -4,  the  sentence 

if  x  -iy 

then  if  P(x+,  y~)  then  P(y+,  x~) 

is  valid. 

If  we  could  add  this  sentence  to  our  deduced  set  for  each  relevant  sentence  P(x+,  y~),  we  could  achieve  a 
considerable  abbreviation  of  the  proof,  at  the  cost  of  a  dramatic  explosion  of  the  search  space.  The  extended 
system  will  behave  as  if  the  sentences  were  present,  achieving  the  same  abbreviation  of  the  proof  and,  at  the 
same  time,  collapsing  rather  than  exploding  the  search  space. 

We  begin  with  the  relation  replacement  rule,  which  is  our  generalization  of  the  paramodulation  rule. 


THE  GROUND  VERSION 

With  respect  to  a  given  relation  -4,  the  rule  allows  us  to  replace  subexpression  occurrences  with  larger  or 
smaller  expressions,  depending  on  their  polarity.  The  ground  version  of  the  rule  which  applies  to  sentences 
with  no  variables,  is  as  follows: 


Rule  (relation  replacement,  ground  version) 


For  any  binary  relation  -4,  ground  expressions  s  and  t,  and  ground  sentences  7[s  -4  t ]  and  p(s+,  t  ), 
we  have 


7[s  -4  t] 
£{s+,  t~) 


7[false ]  or  ff(t+,  s  ). 

Here  $(t+,  s~)  is  obtained  from  §(s+ ,  t~)  by  replacing  certain  positive  occurrences  of  s  with  t 
and  replacing  certain  negative  occurrences  of  t  with  s,  where  polarity  is  taken  in  § (s+ ,  t~)  with 
respect  to  -4. 


In  other  words,  if  7\s  -4 1]  and  £{s+ ,  t~)  are  sentences  in  our  deduced  set,  we  can  add  to  the  set  the  sentence 
( 7{false ]  or  $(t+,  s~}). 

For  a  particular  relation  -4,  we  shall  refer  to  this  rule  as  the  -4-replacement  rule:  thus,  we  have  a  <- 
replacement  rule,  a  <-replacement  rule,  and  so  forth.  Although  the  rule  allows  us  to  replace  occurrences  in 
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$(s+,  t~)  of  both  expressions  s  and  t  at  the  same  time,  it  is  typically  applied  to  replace  occurrences  of  one  or 
the  other  expression,  but  not  both.  Subsequent  application  of  transformation  rules,  to  remove  occurrences 
of  the  truth  symbols  true  and  false,  may  be  regarded  as  part  of  the  relation  replacement  rule  itself. 

There  is  a  polarity  strategy  for  the  relation  replacement  rule,  which  allows  us  to  apply  the  rule  only  if 
some  occurrence  of  s  -4  t  is  positive  (or  of  no  polarity)  in  7[s  H  t]. 

Naturally  we  may  also  require  that  some  occurrence  of  s  or  t  is  actually  replaced;  otherwise,  ${t+ ,  s~) 
is  identical  to  ,p(s+,  t~),  and  the  sentence  we  obtain  is  ( 7[false ]  or  §{s+,  t~));  this  is  weaker  than  the 
sentence  §{s+,  t~ ),  which  was  already  in  the  deduced  set. 

In  illustrating  the  rule  we  draw  boxes  around  the  matching  occurrences  of  s  and  t. 


Example 


In  the  theory  of  the  nonnegative  integers,  suppose  our  deduced  set  contains  the  sentences 


and 


7 : 


if  P(s) 

then  ([  s  |  <  t)  + 


Note  that  the  boxed  occurrence  of  s  in  Q  is  positive  with  respect  to  the  less-than  relation  <.  Therefore  we 
can  apply  the  <-replacement  rule  to  replace  the  occurrence  of  s  in  Q  with  t,  to  deduce 


if  P(s) 

then  false 


or  s  <  t 2, 


which  reduces  under  transformation  to 


(notp(s))  or  s  <  t2. 

The  above  application  of  the  rule  is  in  accordance  with  the  polarity  strategy,  because  the  occurrence  of 
s  <  t  is  positive  in  7.  Note  that  not  every  occurrence  of  s  in  Q  was  replaced  in  applying  the  rule. 

In  a  system  without  the  relation  replacement  rule,  we  could  have  deduced  the  same  conclusion  by 
applying  the  resolution  rule  in  sequence  to  7,  Q,  the  monotonicity  property 

if  x  <  y 
then  x2  <  y2, 

and  the  transitivity  property 


if  x  <y 
then  if  y  <  z 

then  x  <  z. 

The  rule  allows  us  to  draw  the  conclusion  even  if  the  monotonicity  and  transitivity  properties  are  not  in  our 
deduced  set. 


The  following  illustration  of  the  rule  is  extracted  from  the  derivation  of  a  program  to  find  the  maximum 
element  of  a  list  of  numbers. 
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Example 

In  a  theory  of  lists  of  numbers  (integers,  say),  suppose  our  deduced  set  contains  the  sentences 

Uf  =  h 

j  .  no *  then  not  (m  <  |  h  |) 

or 

*-[] 

and 


Q  :  not 


if  g(h)  €  t 

then  g[h)  < 

h 

Note  that  the  boxed  occurrence  of  h  in  Q  is  negative  with  respect  to  <.  Therefore  we  can  apply  the 
oreplacement  rule  to  replace  the  occurrence  of  h  in  Q  with  m,  to  deduce 


not 


not 


if  g{m)  =  h 
then  not  false j 
or 

1 1  =  \\ 

or 

if  g[h)  e  t 
then  g(h)  <  m)J 

This  sentence  reduces  under  true-false  transformation  to 

*-[] 

or 

if  g[h)  €  t 
then  g{h)  <  m 

The  above  application  of  the  rule  is  in  accordance  with  the  polarity  strategy,  because  the  subsentence 
m  <  h  is  positive  in  7. 


not 


Let  us  now  establish  the  soundness  of  the  rule. 


Justification  (relation  replacement,  ground  version) 

We  show  that  the  given  sentences  T\s  ~t  t]  and  $(s+,  t~)  imply  the  conclusion  [T\false]  or  g(t+,  s~)). 
We  distinguish  between  two  cases  and  show  that  in  each  case  one  of  the  two  disjuncts,  7[false]  or  S(t+ ,  s  ), 
is  true. 

In  the  case  in  which  the  subsentence  s  -4  t  is  false,  we  know  (by  the  value  property,  because  s  and 
false  have  the  same  truth  value  and  7\s  t]  is  true)  that  the  first  of  the  disjuncts,  7[false],  is  true. 

In  the  case  in  which  s  -i  t  is  true,  we  know  (by  the  polarity  replacement  proposition,  because  ${s+ ,  t  ) 
is  true)  that  the  second  of  the  disjuncts,  5{t+ ,  s~),  is  true.  j 

As  with  the  resolution  rule,  we  have  established  the  soundness  of  the  ground  version  of  the  relation 
replacement  rule  when  applied  to  sentences  with  no  variables.  We  will  actually  apply  the  ground  version  of 
the  rule  to  sentences  with  variables.  The  above  justification  does  not  extend  to  this  case,  however,  because 
the  value  property  only  holds  for  ground  sentences.  Such  applications  are  an  instance  of  the  following  general 
version  of  the  rule. 
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THE  GENERAL  VERSION 

We  axe  now  ready  to  give  the  general  version  of  the  rule,  which  applies  to  sentences  with  variables  and 
allows  us  to  instantiate  the  variables  as  necessary  to  create  common  subexpressions. 


Rule  (relation  replacement,  general  version) 

For  any  binary  relation  -4,  expressions  s,  t,  7,  and  t,  and  sentences  7\s  -4  t]  and  9{7+ ,  t~),  where 
7  and  Q  are  standardized  apart,  we  have 

7[s  -4  t ] 

9(s+,t~) 

70[false]  or  §6(t6+,sd~) 

where  6  is  a  simultaneous,  most-general  unifier  of  s,7  and  of  t,t. 

More  precisely, 

•  7  has  one  or  more  subsentences  a  -4  t,  s i  -<  t±,  32  -4  t2,  .... 

•  Q  has  one  or  more  subexpressions  a,si,S2,  . . .  and  t,  £i,  £2,  •  •  •  • 

•  6  is  a  simultaneous  most-general  unifier  of  a,  si,  82,  .  .  . ,  7,7lt72,  ...  and  of  t,  t\,  t2,  .... 

/V  A/  A/ 

t)  j  ^2 j  •  •  •  5  li6n.cc 

ad  =  816  =  320  —  ...  =  70  =  8 \0  =  7 20  =  . . . 
and 

t0  =  tiO  =  t26  =  ...  =  t0  =  ti6  -  t20  = - 

•  The  conclusion  of  the  rule  is  obtained  by  replacing  all  occurrences  of  (s  -4  t)0  in  76  with 
false,  replacing  certain  positive  occurrences  of  sd  in  Qd  with  td,  replacing  certain  negative 
occurrences  of  td  in  §6  with  sd,  and  taking  the  disjunction  of  the  two  results.  Here  polarity 
is  in  £9  with  respect  to  -4. 

In  other  words,  we  apply  the  ground  version  of  the  rule  to  76  and  Qd. 


The  justification  of  the  general  version  of  the  rule,  which  we  omit,  is  straightforward  now  that  the 
soundness  of  the  ground  version  has  been  established.  The  proof  is  analogous  to  the  proof  of  the  general 
version  of  the  resolution  rule.  The  polarity  strategy  for  this  rule  allows  us  to  assume  that  at  least  one 
occurrence  of  the  subsentence  (s  -4  t)0  is  positive  or  of  no  polarity  in  70. 


Example 

In  the  theory  of  sets,  suppose  our  deduced  set  contains  the  sentences 


7: 


if  p(x) 


then  (  h[x,a)  C  6  )+  or  (  h(b,  y)  ]  C  |  x  |)  + 


and 


9  ■  (c  €  h{u,  a)  +  ~  v)  or  q(u,v), 


where  ~  is  the  set  difference  function. 
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Note  that 

•  7  contains  the  [positive]  subsentences  h(x,  a)  c  6  and  h(b,  y)  C  x. 

•  The  boxed  subterms  h(x,a),h(b,y),  and  h(u,a)  and  the  boxed  subterms  b  and  x  are 
simultaneously  unifiable,  with  most-general  unifier 

6  :  {x  <—  b,  u  +—  6,  y  *—  a}. 

•  The  boxed  occurrence  of  h(u,  a)  is  positive  in  Q  with  respect  to  C. 

Therefore  we  can  apply  the  C-replacement  rule,  replacing  all  occurrences  of  h(b,  a)  C  b  in  76  with  false, 
replacing  the  occurrence  of  h[b,  a)  in  Q6  with  b,  and  taking  the  disjunction  of  the  results,  to  obtain 

if  p(b) 

then  false  or  false 
or 

(c  €  b  ~  t>)  or  q(b,  v). 

This  sentence  reduces  under  transformation  to 
(raofp(6))  or  (c  €  b  ~  u)  or  q(b,v). 

The  above  application  of  the  rule  is  in  accordance  with  the  polarity  strategy. 

Use  of  the  relation  replacement  rule  allows  a  dramatic  abbreviation  of  many  proofs.  For  this  reason 
and  because  the  rule  enables  us  to  eliminate  troublesome  axioms  from  the  deduced  set,  the  search  space 
is  constricted.  We  have  not  established  completeness  results  for  the  rule;  judging  from  the  corresponding 
theorem  for  paramodulation  (Brand  [75]),  we  expect  such  results  to  be  difficult. 


SPECIAL  CASE:  THE  EQUALITY  REPLACEMENT  RULE 

The  most  important  instance  of  the  relation  replacement  rule  is  obtained  by  taking  the  relation  -4  to 
be  the  equality  relation  =.  This  special  case  of  the  rule,  which  allows  us  to  replace  equals  with  equals,  is  a 
nonclausal  version  of  the  paramodulation  rule.  It  may  be  expressed  as  follows: 


Rule  (equality  replacement) 

For  any  terms  s,  t,  s,  and  t,  and  sentences  7[s  =  t]  and  $(s,  t),  where  7  and  Q  are  standardized 
apart,  we  have 

7[s  =  t) 

9{s,7) 

76\false]  or  Q9(t9,sd) 

where  $  is  a  simultaneous,  most-general  unifier  of  s,  s  and  of  t,  t. 


The  notation  is  analogous  to  that  for  the  general  relation-replacement  rule.  We  do  not  need  to  restrict 
the  polarity  of  the  replaced  subterms  s8  and  td  in  §6,  because  any  term  has  both  polarities  with  respect  to 
the  equality  relation.  The  polarity  strategy  is  the  same  as  before. 

The  following  illustration  of  the  equality  replacement  rule  is  extracted  from  the  derivation  of  an  integer 
quotient  program. 


86 


5.  The  Relation  Replacement  Rule 


Example 

In  the  theory  of  the  nonnegative  integers,  suppose  our  deduced  set  contains  the  sentences 
7 (|  0-it  |  =  0)+ 

and 

Q  :  not  ( |  z  •  d  |  <  n  and  [z  +  l)  •  d  >  n) . 

(In  the  derivation,  7  is  an  axiom  and  Q  is  deduced  from  the  negation  of  the  theorem.) 

Note  that 

•  7  contains  the  (positive)  subsentence  0  •  u  =  0. 

•  The  boxed  subterms  0  •  u  and  z  ■  d  are  unifiable,  with  most-general  unifier 

6  :  {z  *—  0,  u  «—  d}. 

Therefore  we  can  apply  the  =-replacement  rule,  replacing  all  occurrences  of  0  ■  d  =  0  in  J0  with  false, 
replacing  the  occurrence  of  0  •  d  in  Q8  with  0,  and  taking  the  disjunction  of  the  results,  to  deduce 

false 

or 

not  (0  <  n  and  (0  +  l)  ■  d  >  n). 

This  sentence  reduces  under  true- false  transformation  to 
not  (0  <  n  and  (0  +  1)  •  d  >  n) .  ^ 

SPECIAL  CASE:  THE  EQUIVALENCE  REPLACEMENT  RULE 

Another  important  instance  of  the  relation  replacement  rule  is  obtained  by  taking  the  relation  to  be 
the  equivalence  connective  =.  This  is  possible  only  because  we  regard  connectives  as  relations  over  truth 
values.  The  rule  is  analogous  to  the  equality  replacement  rule. 

Rule  (equivalence  replacement  rule) 

For  any  sentences  5,  T,  S,  T,  7[S  =  T],  and  Q(S,  T),  where  7  and  Q  are  standardized  apart,  we  have 

7[S  =  T] 

$(lf) 

70\false j  or  §6{T6,  $9) 

where  6  is  a  simultaneous,  most-general  unifier  of  S,  S  and  of  T,  T. 

As  in  the  equality  replacement  rule,  we  do  not  need  to  restrict  the  polarities  of  the  replaced  subsentences 
S9  and  T 8  in  Q8,  because  any  subsentence  has  both  polarities  with  respect  to  the  equivalence  relation.  The 
polarity  strategy  is  the  same  as  for  the  general  relation-replacement  rule. 

The  following  illustration  of  the  equivalence  replacement  rule  (or  =-replacement  rule)  is  drawn  from  the 
derivation  of  a  program  to  find  the  maximum  of  a  list  of  numbers  (e.g.,  integers  or  reals). 


6.  The  Relation- Matching  Rule 


Example 


In  the  theory  of  lists  of  (say)  integers,  suppose  our  deduced  set  contains  the  sentences 
if  not  (x  =  {  }) 


then  u  g  x  =  [u  =  h  or  u  g  t]J 


z  g  a  and 

Q  :  not  if  g{z)  €  a 
[then  z  >  g[z) 

(In  the  derivation,  7  is  an  axiom  and  $  is  deduced  from  the  negation  of  the  theorem.) 

Note  that  the  boxed  subsentences  it  g  x  and  g(z)  g  s  are  unifiable,  with  most-general  unifier 
d  :  {it  *  g(z),  x  *—  s). 

Therefore  we  can  apply  the  =-replacement  rule,  replacing  the  occurrence  of  g{z)  6  a  in  §6  with 
g[z)  =  h  or  g(z)  €  t, 

to  deduce 

if  not  (s  =  {  }) 

then  false 
or 

z  €  s  and 

not  if  \g{z)  =  h  or  g(z)  €  t] 
then  z  >  g(z) 

This  sentence  reduces  under  transformation  to 

s  =  {} 


z g  s  and 

not  *7  \g{z)  =  h  or  g(z)  g  t] 
then  z  >  g(z) 


6.  THE  RELATION-MATCHING  RULE 

We  are  about  to  introduce  not  a  rule  in  itself  but  an  augmentation  of  the  other  rules.  The  resolution  and 
relation  replacement  rules  draw  a  conclusion  when  one  subexpression  in  our  proof  unifies  with  another.  The 
relation-matching  augmentation  allows  these  rules  to  apply  even  if  the  two  expressions  fail  to  unify,  provided 
that  certain  conditions  can  be  introduced  into  the  conclusion.  We  begin  by  describing  the  augmentation  of 
the  resolution  rule. 


RESOLUTION  WITH  RELATION  MATCHING:  GROUND  VERSION 


This  rule  is  our  generalization  of  the  E-resolution  rule.  The  ground  version  of  the  rule  is  as  follows: 
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Rule  (resolution  with  relation  matching,  ground  version) 

For  any  binary  relation  ground  expressions  s  and  t,  and  ground  sentences  P(s+,  t+,  s~ ,  t~), 
7[P(s+ ,  s+ ,  t~,  t~)],  and  S\P  (t+ ,  t+ ,  s~ ,  s~)]  we  have 

7[P(s+,  s+,  t~,  t-)] 

Q[P{t+,  t+,  s~,  a")] 


if  s  ^  t 

then  7[false\  or  Q[true\ 

Here 


•  P{s+ ,  t+,  s  ,  t  )  is  an  arbitrary  sentence,  called  the  intermediate  sentence,  which  may 
have  positive  and  negative  occurrences  of  s  and  f;  polarity  is  taken  with  respect  to  -i. 

•  The  sentence  7  may  have  several  distinct  subsentences  P(s+ ,  s+,  t~,  t~),  each  obtained 
from  the  intermediate  sentence  P(s+ ,  t+ ,  s~ ,  t~)  by  replacing  certain  of  the  positive 
occurrences  of  t  with  a  and  certain  of  the  negative  occurrences  of  s  with  t. 

•  Similarly,  Q  may  have  several  distinct  subsentences  P(t+ ,  t+ ,  s~ ,  s~),  each  obtained  from 
the  intermediate  sentence  by  replacing  certain  of  the  positive  occurrences  of  s  with  t  and 
certain  of  the  negative  occurrences  of  t  with  s. 

For  a  particular  relation  we  shall  refer  to  the  above  as  the  resolution  rule  with  -<-matching. 

Note  that  if  all  the  subsentences  P{s+ ,  s+ ,  t~ ,  t~)  and  P(t+ ,  t+ ,  s~ ,  s~)  were  identical,  we  could 
apply  the  original  resolution  rule,  obtaining  the  conclusion  (7[false\  or  §{true\).  The  augmented  rule  allows 
us  to  derive  the  same  conclusion  rule  even  if  the  subsentences  P  do  not  match  exactly,  provided  that  the 
mismatches  occur  between  terms  a  and  t  of  restricted  polarity  and  that  the  condition  a  -4  t  is  introduced. 

The  polarity  strategy  allows  us  to  apply  the  rule  only  if  an  occurrence  of  one  of  the  sentences  P  (a+ ,  a+ ,  t~ ,  t 
is  positive  or  of  no  polarity  in  7  and  if  an  occurrence  of  one  of  the  sentences  P(t+,  t+,  a-,  a-)  is  negative 
or  of  no  polarity  in  Q. 

Note  that  the  intermediate  sentence  P(s+,  t+,  s~,  t~)  does  not  necessarily  appear  in  either  of  the 
sentences  of  the  deduced  set  and  that  the  rule  does  not  stipulate  how  to  find  such  a  sentence.  We  shall 
discuss  the  choice  of  the  intermediate  sentence  in  the  subsection  Selection  of  Application  Parameters. 


Example 


In  the  theory  of  lists,  suppose  that  our  deduced  set  includes  the  sentences 


7  :  p{£)  or  c  €  (tail(l)) 


r+ 


and 


5 : 


if 


cel+ 


then  g(£). 


The  two  boxed  subsentences  are  not  identical.  Let  us  take  our  intermediate  sentence  to  be  one  of  them, 
P  :  c  S  tail(i).  The  subterm  a+  :  tail(i)  is  positive  in  c  €  tail(l)  with  respect  to  the  proper-sublist  relation 
-<u,f  The  other  boxed  subsentence  c  &  l  can  be  obtained  by  replacing  this  subterm  with  t+  :  l.  Therefore 
we  can  apply  the  resolution  rule  with  -<;jat-matching  to  obtain 


if  tail(i)  <ii)t  £ 
then  p(l)  or  false 
or 

if  true  then  q{i), 
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which  reduces  under  transformation  to 

if  tail{l )  <u,t  t 
then  p(£)  or  q{&). 


We  shall  give  some  more  complex  examples  of  the  application  of  the  rule  after  we  establish  its  soundness. 


Justification  (resolution  with  relation  matching,  ground  version) 

Note  that  (by  the  invertibility  of  partial  replacement)  the  intermediate  sentence  P(s+,  t+,  s~ ,  t~)  can 
be  obtained  from  any  of  the  subsentences  P(s+ ,  s+,  t~,  t~)  of  7  by  replacing  certain  positive  occurrences 
of  s  with  t  and  certain  negative  occurrences  of  t  with  a,  where  polarity  is  taken  in  P  with  respect  to  -i. 
Therefore  (by  the  polarity  replacement  proposition)  each  of  the  sentences 

if  s  ^  t 

(f)  then  if  P(s+,  s+ ,  t~ ,  t~) 

then  P(s+,  t+ ,  s~ ,  t~ ) 


is  valid. 

Also  any  of  the  subsentences  P(t+,  t+,  a-,  s~)  of  Q  can  be  obtained  from  the  intermediate  sentence 
P(a+,  t+,  a-,  t~)  by  replacing  certain  positive  occurrences  of  a  with  t  and  certain  negative  occurrences  of 
t  with  a.  Therefore  (by  the  polarity  replacement  proposition  again)  each  of  the  sentences 

if  s  -<t 

(t)  then  if  P{s+,  t+ ,  s~ ,  t~) 

then  P(t+,  t+,  a-,  a-) 

is  valid. 

Suppose  that  the  sentences  7\P{s+,  a+,  t~ ,  f-)]  and  Q[P{t+ ,  t+,  a-,  a-)]  are  true  and  that  a  ^  t. 
We  would  like  to  show  that  then  (/[/alae]  or  §[true})  is  true.  The  proof  distinguishes  between  two  cases, 
depending  on  whether  the  intermediate  sentence  P(s+,  t+ ,  s~ ,  t~)  is  false  or  true.  We  show  that  in  each 
case  one  of  the  two  disjuncts,  7[false ]  or  §[true\,  is  true. 


Case :  P(s+ ,  i+,  s  ,  t  )  is  false 

Then  by  our  previous  conclusion  (f),  because  a  ^  t,  we  know  each  of  the  subsentences  P(s+,  a+,  t~ ,  t~) 
of  /  is  false.  Because  7\P{s+ ,  a+,  t~,  f-)]  is  true  and  because  the  subsentences  P(s+,  a+,  t~,  t~)  and 
false  all  have  the  same  truth  value,  we  know  (by  the  value  property)  that  the  first  disjunct,  7[false],  is  true. 


Case:  P{s+,  f+,  a  ,  t  )  is  true 

Then  by  our  previous  conclusion  (J),  because  a  ^  t,  we  know  each  of  the  sentences  P(t+ ,  t+ ,  a-,  a") 
is  true.  Because  $\P {t+ ,  t+,  s~ ,  a-)]  is  true  and  because  P(t+,  t+ ,  a-,  a-)  and  true  have  the  same  truth 
value,  we  know  (by  the  value  property  again)  that  the  second  disjunct,  Q{true),  is  true. 


The  resolution  rule  with  relation  matching  must  be  regulated  with  strict  heuristic  controls;  if  the  controls 
are  too  permissive,  any  two  subsentences  may  be  matched. 

The  following  example  is  a  bit  contrived  but  illustrates  some  of  the  power  of  the  rule. 
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Example 

In  the  theory  of  sets,  suppose  our  deduced  set  includes  the  two  sentences 

+ 

T-.  or 

+ 


and 

Q :  not 


Let  us  take  our  intermediate  sentence  to  be 

P  :  e  €E  ((s+~a)u(6~a-)u(f+~c)u(d~f-)). 

The  occurrences  of  s  and  t  have  been  annotated  with  their  polarities  in  ?  with  respect  to  the  proper-subset 
relation  C.  Note  that  each  of  the  boxed  sentences  in  7  may  be  obtained  from  P  by  replacing  certain  of  the 
positive  occurrences  of  t  with  s  and  certain  of  the  negative  occurrences  of  s  with  t.  Also,  each  of  the  boxed 
subsentences  of  §  may  be  obtained  from  P  by  replacing  certain  of  the  positive  occurrences  of  s  with  t  and 
certain  of  the  negative  occurrences  of  t  with  s.  Therefore  we  can  apply  the  resolution  rule  with  C-matching 
to  obtain 

if  s  Ct 

then  false  or  false 
or 

not  ( true  and  true)i 

which  reduces  under  transformation  to  the  sentence 
not  (j  C  t), 

Note  that  this  conclusion,  obtained  by  a  single  application  of  the  rule,  is  not  immediately  evident  to  the 
human  reader. 


SPECIAL  CASE:  RESOLUTION  WITH  EQUALITY  MATCHING 

In  the  case  in  which  the  relation  -<  is  taken  to  be  the  equality  relation  =,  the  resolution  rule  with 
relation  matching  reduces  to  a  nonclausal  variant  of  the  E-resolution  rule.  It  may  be  expressed  (in  the 
ground  version)  as  follows: 

Rule  (resolution  with  equality  matching) 

For  any  terms  s  and  t  and  sentences  P{s,t,s,t),  7[P{s,  s,t,t,)],  and  $[P{t,  t,  s,  a)],  we  have 

7[P(s,s,t,t)] 

Q[P(t,  i,s,s)j 

if  s  =  t 

then  7[false\  or  Q[true\. 


e  €  ((t+  ~  o)  U  (6  ~  s")  U  ( t+  ~  c)  U  (d  ~  f-)) 
and 

e  €E  (($"*"  ~  a)  U  (6  ~  s~)  U  (t+  ~  c)  U  (d ~  s~ )) 
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Here  P{s,  a,  t,  t)  and  P  (t,  t,  a,  a)  are  obtained  from  P(s,  t,  a,  t)  by  replacing  certain  occurrences  of  s  with  t  and 
certain  occurrences  of  t  with  a.  In  other  words,  all  the  subsentences  P(3,s,  t,  t)  and  P  ( t ,  t,s,s)  are  identical 
except  that  one  may  have  occurrences  of  a  where  another  has  occurrences  of  t.  We  do  not  need  to  restrict 
the  polarities,  because  every  subterm  of  a  sentence  is  both  positive  and  negative  with  respect  to  the  equality 
relation. 


MULTIPLE  MISMATCHED  SUBSENTENCES 

The  resolution  rule  with  relation  matching  can  be  extended  to  allow  several  corresponding  pairs  of 
subexpressions  ai,  ti,  a 2,  t?,  ...  and  sn,  tn  rather  than  a  single  pair  a,  t,  and  several  binary  relations  -*i,  -<2 
,  . . . ,  and  rather  than  a  single  binary  relation  -4.  To  write  the  extended  rule  succinctly,  we  abbreviate 
Si,  a 2,  . .  • ,  An  as  s,  f|,  t2,  .  •  • ,  tn  t,  “^1,  . . . ,  and  as  and 

si  ti  and  a 2  ^2  *2  and  . . .  and  sn  as  3  ^  t. 

Then  for  any  binary  relations  expressions  3  and  t,  and  sentences  P(S+,t+,a~  ,t~),  f[P (a+,  3+  ,t~ ,  t-)] , 
and  Q  [P  ( t+ ,  t+ ,  a~ ,  a~ )] ,  we  have 

?[p{3+,§+,i-,i-)] 

9[P(t+,t+,§-,s-)] 

if  a  ^  t 

then  7[falae\  or  Q\true\. 

The  extended  rule  is  easily  justified,  given  the  soundness  of  the  original  rule. 


RESOLUTION  WITH  RELATION  MATCHING:  GENERAL  VERSION 

The  general  version  of  the  rule  allows  us  to  instantiate  the  variables  of  the  given  sentences  as  necessary 
and  then  to  apply  the  ground  version.  The  precise  statement,  which  we  omit,  is  analogous  to  the  precise 
statement  of  the  general  version  of  the  resolution  rule.  We  illustrate  the  application  of  the  general  rule  with 
an  example. 


Example 


Suppose  our  deduced  set  contains  the  sentences 


and 


7: 


if  q(u) 


then 


9  :  not 

Here  the  annotations  of  the  subterms  within  the  boxed  subsentences  indicate  their  polarity  in  these  subsen¬ 
tences  with  respect  to  a  binary  relation  -4. 


p(£+,  /(£)+) 


The  substitution  9  :  {u  *—  1}  fails  to  unify  the  boxed  subsentences  of  7  and  9>  the  results  of  applying  9 
to  these  subsentences  are  the  sentences  p(£+,£+)  and  p(£+,  /(£}+),  respectively.  Note  that  the  mismatched 
occurrences  of  £  and  /(£)  are  positive  in  these  sentences  with  respect  to  -4. 
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To  apply  the  ground  version  of  the  rule  to  76  and  §9,  let  us  take  the  intermediate  sentence  to  be 
p(£+,£+).  We  obtain 

if  t  d  m 

then  ,  or  (not  true), 

then  false 

which  reduces  under  true- false  transformation  to 

if  t  <  m 

then  notq(i). 


SELECTION  OF  APPLICATION  PARAMETERS 

For  each  application  of  the  resolution  rule  with  relation  matching,  we  must  select  the  application  pa¬ 
rameters,  i.e.,  the  substitution  8,  the  intermediate  sentence  P,  and  the  subexpressions  a  and  t.  In  fact,  a 
satisfactory  choice  of  application  parameters  is  not  straightforward:  it  depends  on  what  other  sentences  are 
in  the  deductive  set.  Some  considerations  influencing  the  decision  are  illustrated  in  the  next  few  sections. 

Choice  of  Substitution 

The  substitution  8  and  the  intermediate  sentence  P  for  applying  the  rule  are  not  necessarily  unique. 

In  the  example  above,  consider  again  the  boxed  subsentences  p(u+,u+)  and  p(£+,/(£)+)  of  7  and  Q. 
Instead  of  the  substitution  8  :  {u  «—  £},  consider  the  substitution  8'  :  {u  *—  /(£)}.  This  substitution  also 
fails  to  unify  the  boxed  subsentences;  the  results  of  applying  6'  to  the  boxed  subsentences  are  the  sentences 
p{f{i)+> /M+)  and  p(£+ ,  /(£)+) ,  respectively.  Note  that  the  mismatched  occurrences  of  /(£)  and  £  are 
positive  in  these  sentences  with  respect  to 

To  apply  the  ground  version  of  the  rule  to  76'  and  Q8' ,  let  us  take  the  intermediate  sentence  to  be 
p(/(£)+,  /(£)+).  We  obtain 

if  m  *  t 

then  or  (not  true), 

then  false 

which  reduces  under  true-false  transformation  to 
if  /(£)  d  £ 

then  notq(l). 

This  is  not  equivalent  to  the  sentence  we  obtained  by  applying  the  rule  with  the  substitution  8, 

if  ^  m 

then  notq(i). 

In  other  words,  we  must  consider  both  ways  of  applying  the  rule. 

To  Unify  or  Not  to  Unify 

In  previous  examples,  we  have  applied  the  resolution  rule  with  relation  matching  only  when  it  is  illegal 
to  apply  the  ordinary  resolution  rule  because  the  matched  subsentences  fail  to  unify.  In  some  cases,  however, 
we  must  use  relation  matching  to  obtain  a  refutation  even  though  the  matched  subsentences  do  unify  and 
the  resolution  rule  could  be  applied. 
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For  example,  suppose  our  deduced  set  consists  of  the  sentences 


1.  p(z+)  or  q{x+) 


2. 

3. 


4.  c  a 


5.  c  -<b, 

where  x  is  positive  in  the  boxed  subsentence  p(z)  and  in  the  subsentence  q(x)  with  respect  to  the  relation 
as  indicated  by  its  annotation. 

It  is  legal  to  apply  the  ordinary  resolution  rule  to  the  first  two  sentences,  taking  the  unifier  to  be 
{x  <—  a},  to  deduce  (after  transformation) 

$(«)• 

However,  this  sentence  is  of  no  use  in  a  refutation. 

If  instead  we  apply  the  resolution  rule  with  -^-matching  to  the  same  boxed  subsentences,  taking  the 
unifier  to  be  the  empty  substitution  {  },  we  obtain  (after  transformation) 

6.  if  x  H  a  then  q[x+)  +. 

We  can  then  apply  the  resolution  rule  to  sentences  6  and  3,  taking  the  unifier  to  be  the  empty  substitution 
{  },  to  obtain  (after  transformation) 

7.  if  x  d  b  then  not  (x  ^  a). 

We  finally  obtain  a  refutation  by  applying  the  resolution  rule  to  this  sentence  and  the  last  two  sentences  in 
turn;  the  unifier  is  {z  +—  c}. 

In  applying  the  ordinary  resolution  rule,  we  committed  x  to  be  a;  this  turned  out  to  be  a  mistake.  In 
applying  the  resolution  rule  with  -^-matching  instead,  we  left  x  free  to  be  any  element  such  that  x  ^  o;  in 
particular,  we  could  then  take  x  to  be  c. 

Choice  of  Mismatched  Subexpressions 

In  the  examples  of  resolution  with  relation  matching  we  have  seen,  we  have  always  taken  the  mismatched 
subexpressions  s  and  t  to  be  as  small  as  possible.  Sometimes  this  choice  costs  us  a  proof. 

For  instance,  suppose  our  deduced  set  consists  of  the  sentences 


3.  f(a)  =  f(b). 

If  we  apply  the  resolution  rule  with  equality  matching  to  the  first  two  sentences,  taking  s  to  be  a  and  t 
to  be  b,  we  obtain 

if  a  =  b 

then  false  or  not  true, 


which  reduces  under  transformation  to 
not  (a  —  6). 
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This  sentence  is  of  no  use  in  a  refutation. 

On  the  other  hand,  if  instead  we  apply  the  same  rule  taking  s  to  be  /(a)  and  t  to  be  f(b),  we  obtain 

if  f{<*)  =  f(b) 

then  false  or  not  true, 
which  reduces  under  transformation  to 
not  (/(o)  =  /(&)). 

A  refutation  can  be  obtained  immediately  by  applying  the  resolution  rule  to  the  third  sentence  and  this  one. 

In  the  preceding  examples,  we  have  seen  that  in  applying  the  resolution  rule  with  relation  matching, 
the  choice  of  appropriate  application  parameters,  i.e.,  the  substitution  6,  the  intermediate  sentence  P,  and 
the  mismatched  subexpressions  s  and  t,  are  not  unique  and  depend  on  the  other  sentences  in  the  deduced 
set.  Digricoli  [83]  provides  an  algorithm  to  generate  all  legal  sets  of  application  parameters.  This  algorithm 
is  phrased  in  terms  of  his  variant  of  the  E-resolution  rule  but  extends  readily  to  the  general,  nonclausal 
case.  Digricoli  also  suggests  a  heuristic  viability  criterion  for  selecting  a  single  appropriate  set  of  application 
parameters;  this  criterion  appears  to  extend  to  the  general  case  as  well. 


REPLACEMENT  WITH  RELATION  MATCHING:  GROUND  VERSION 

We  have  shown  how  to  augment  the  resolution  rule  to  apply  even  if  the  matched  subsentences  are  not 
entirely  unified  by  the  substitution.  We  now  introduce  an  analogous  augmentation  of  the  relation  replacement 
rule. 


Rule  (replacement  with  relation  matching,  ground  version) 

For  any  binary  relations  Hi  and  H2,  ground  expressions  3,  t,  u(s+  ,t+,  s~,  t~),  and  v(s+  ,t+ ,  s~  ,t~), 
and  ground  sentences 

J[u(s+,  a+,  t~,  t~)  Hi  v<s+,  s+,  t~,  t~)] 

and 


9{u{t+,  t+,  S  ,3  )  +  ,  v(t+,  t+,  3  ,  3  )  }, 
we  have 

7\u{s+ ,  S  +  ,  t~ ,  t~)  Hi  v(s+,  s+,  t~,  t-)] 

9(u{t+,  t+,  S~,  S-)  +  ,  v(t+,  t+,  S~,  3-)-) 

if  S  -<2  t 

then  7[false ]  or  Q{v(t+ ,  t+,  s~ ,  s-)+,  u(t+,  t+ ,  s~,  s~)~) 

Here 


•  The  expressions  u(s+,  t+,  s  ,  t  )  and  v(s+,  t+,  s  ,  t  )  are  arbitrary  expressions.  The 
sentence  u(s+,  t+ ,  s~ ,  t~)  Hi  v(s+ ,  t+,  s~ ,  f~)  is  called  the  intermediate  sentence. 

•  The  subsentences  u(s+,  s+,  t~ ,  t~)  Hi  v(s+,  s+,  t~,  t~)  of  7  are  obtained  from  the 
intermediate  sentence  by  replacing  certain  positive  occurrences  of  t  with  s  and  certain 
negative  occurrences  of  s  with  t,  where  polarity  is  taken  in  the  intermediate  sentence  with 
respect  to  Ho- 

•  The  subexpressions  u{t+,  t+ ,  s~ ,  s~)  and  v{t+ ,  t+,  s~ ,  s~)  of  Q  are  obtained  from 
u(s+,  t+,  s~,  t~)  and  v(s+ ,  t+,  s~ ,  t~),  respectively,  by  replacing  certain  occurrences  of  s 
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with  t  and  certain  occurrences  of  t  with  a,  where  again  polarity  is  taken  in  the  intermediate 
sentence  with  respect  to  -<2- 

•  The  subsentence  S{v{t+ ,  t+,  s~ ,  a~)+,  u(t+,  t+,  s~,  s~)~)  of  the  conclusion  is  obtained 
from  @(u(t+,  t+,  s~ ,  s~)+,  v(t+,  t+,  s~ ,  s~)~)  by  replacing  certain  positive  occur¬ 
rences  of  u(t+,  t+,  s~,  s~)  with  v(t+ ,  t+ ,  s~,  s~)  and  certain  negative  occurrences  of 
v(t+,  t+,  s~ ,  s~)  with  u(t+,  t+,  s~ ,  s~),  where  the  polarity  of  u  and  v  is  taken  in  §  with 
respect  to  -«i. 


For  particular  binary  relations  -<i  and  -<2,  we  shall  call  this  the  -«i -replacement  rule  with  -42-matching. 
Note  that  if  tt(f+,  t+,  s~ ,  s~)  and  v(t+ ,  t+,  s~ ,  s~)  were  identical  to  u(s+,  s+,  t~ ,  t~)  and  v(s+,  s+,  t~,  t~), 
respectively,  we  could  apply  the  original  -<1  -replacement  rule  without  -<2-matching,  obtaining  the  conclusion 

7[false]  or  $(v(t+,  t+ ,  s~ ,  s-)+,  u(t+,  t+,  s~ ,  s~)~). 

The  augmented  rule  allows  us  to  derive  the  same  conclusion,  even  if  the  subexpressions  do  not  match  exactly, 
provided  that  the  mismatches  occur  between  subexpressions  s  and  t  of  restricted  polarity  with  respect  to 
-<2  and  that  the  condition  s  t  is  added. 


Example 

In  a  theory  that  includes  the  lists  and  the  integers,  suppose  our  deduced  set  Contains  the  sentences 


7  :  (  length(m  )  <  a)  or  p(m) 


and 


§  :  if  q(£)  then  (  length(£  )  +  >  b), 


where  i  and  m  are  lists  and  a  and  b  are  integers. 


The  two  boxed  subexpressions  are  not  identical,  so  we  cannot  apply  the  original  <-replacement  rule. 
To  apply  the  augmented  rule,  let  us  take  our  intermediate  sentence  to  be  length(£)  <  a.  With  respect  to  the 
proper  sublist  relation  the  subterm  s~  :  t  is  negative  in  the  intermediate  sentence  u  -4\  v  :  length[l)  <  a. 
From  this  sentence  we  can  obtain  the  subsentence  length{m)  <  a  of  7  by  replacing  the  negative  occurrence 
of  L  with  t~  :  m.  Therefore,  by  the  <-replacement  rule  with  -<;{«t-matching,  we  deduce 


■if  £  —list  m 
then  false  or  p(m) 
or 

if  q{l)  then  a  >  6, 

Here  the  subsentence  a  >  b  of  the  conclusion  is  obtained  from  the  subsentence  length(i)  >  b  of  p  by  replacing 
a  positive  occurrence  of  ti+  :  length(£)  with  v+  :  b,  where  polarity  is  taken  in  Q  with  respect  to  the  weak 
less-than  relation  <.  The  conclusion  reduces  under  transformation  to 


if  £  —list  m 

then  p(m)  or 

if  q{£)  th-en  a  >  b. 

Now  let  us  establish  the  soundness  of  the  rule. 


Justification  (replacement  with  relation  matching,  ground  version) 

Note  that  (by  the  invertibility  of  partial  replacements),  the  intermediate  sentence  u(s+ ,  t+ ,  s~ ,  t~)  -<1 
«{s+,  t+,  s-,  t~  ')  can  be  obtained  from  any  of  the  subsentences  u(s+,  s+,  t  ,  t  )  -<1  v(s+,  s+,  t  ,  t  )  of 
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7  by  replacing  certain  positive  occurrences  of  s  with  t  and  certain  negative  occurrences  of  t  with  s,  where 
polarity  is  taken  in  the  subsentences  with  respect  to  -«2.  Therefore  (by  the  polarity  replacement  proposition), 
each  of  the  sentences 

if  s  r<2 1 

(f)  then  if  u{$+ ,  s+,  t~,  t~)  -<i  u(s+,  s+,  t~,  t~) 

then  u(s+,  t+,  s~ ,  t~)  v(s+,  t+,  s~,  t~) 

is  valid. 

Also  any  of  the  sentences  u(t+,  t+,  s~ ,  s~)  -<i  v(t+,  t+,  s~,  s~)  can  be  obtained  from  the  intermediate 
sentence  t i(s+,  t+,  s~ ,  t~)  -<i  o(s+ ,  t+,  s~ ,  t~)  by  replacing  certain  positive  occurrences  of  a  with  t  and 
certain  negative  occurrences  of  t  with  s,  where  polarity  is  taken  in  the  intermediate  sentence  with  respect  to 
-42.  Therefore  (by  the  polarity  replacement  proposition  again)  each  of  the  sentences 

if  s  r*2  t 

($)  then  if  u(s+ ,  f+,  s~ ,  t~)  -«i  u(s+,  t+,  s~,  t~) 

then  u(t+,  t+ ,  s~ ,  s~)  -<i  v(t+,  t+,  s~,  s~) 

is  valid. 

Furthermore  the  subsentence  §(v(t+,  t+,  s~ ,  a-)+,  u(t+,  t+,  s~ ,  s~)~)  of  the  conclusion  can  be 
obtained  from  the  given  sentence  §{u(t+ ,  t+,  s~ ,  s~)+,  v(t+ ,  t+,  s~ ,  a~)~)  of  the  deduced  set  by  replacing 
certain  positive  occurrences  of  u(t+,  t+,  s~ ,  s~)  with  v(t+ ,  t+ ,  s~ ,  s~)  and  certain  negative  occurrences  of 
v(t+,  t+,  s~ ,  s~)  with  u(t+,  t+ ,  s~,  s~),  where  polarity  is  taken  in  Q  with  respect  to  -<i.  Therefore  (by 
the  polarity  replacement  proposition  once  again)  each  of  the  sentences 

if  u(t+,  t+ ,  s~ ,  s~)  v(t+,  t+,  s~ ,  s~) 

(ft)  then  if  S(u(t+,  t+,  s~ ,  s~)+ ,  v(t+,  t+,  s',  s“» 

then  Q{v{t+ ,  t+,  s~ ,  s~)+,  «(t+,  t+ ,  s~ ,  s~)~) 

is  valid. 

Suppose  that  the  ground  sentences 

7[u(s+,  a+,  t~,  t~)  -<i  u(s+,  s+,  t~,  t~)]  and  9iu(t+>  t+>  s~ >  s~)+ >  vit+  >  t+>  s~>  s~)~ ) 

are  true  and  that  s  -<2  We  would  like  to  show  that  then 

7\false]  or  $(v(t+,  t+,  s~ ,  s~)+,  u(t+,  t+ ,  s~,  s~)~) 

is  true.  The  proof  distinguishes  between  two  cases,  depending  on  whether  the  intermediate  sentence  is  false  or 
true.  We  show  that  in  each  case  one  of  the  two  disjuncts,  7[false ]  or  §(v(t+,t+,  s~ ,  s~)+  ,u(t+ ,  t+ ,  s~ 
is  true. 


Case:  u{s+ ,  t+,  s  ,  t  )  -<i  v(s+ ,  t+ ,  s  ,  t  )  is  false 

Then  by  our  previous  conclusion  (f),  because  s  we  know  each  of  the  subsentences  u(s+ ,  s+  ,t~  ,t~)  -<x 
u(s+,  s+,  t~ ,  t~)  of  7  is  false.  Because  7\u(s+,  s+ ,  t~,  t~)  -<i  u(s+,  s+,  t~ ,  t-)]  is  true  and  because  the 
sentences  u(s+,  s+,  t~,  t~)  -<i  u(s+,  s+,  t~ ,  t~)  and  false  all  have  the  same  truth  value,  we  know  (by  the 
value  property)  that  the  first  disjunct,  7\false],  is  true. 


Case:  u(s+,  t+ ,  s  ,  t  )  -<!  u(s+,  t+,  s  ,  t  )  is  true 

Then  by  our  previous  conclusion  (f),  because  s  -<2  t,  we  know  each  of  the  sentences  u(t+,  t+,  s~ ,  s~)  -<i 
v(t+,  t+,  S-,  s  )  is  true.  Therefore  by  several  applications  of  our  previous  conclusion  (ft) >  because 

9(u{t+,  t+,  s~,  s~)+,  v{t+,  t+,  s~,  a-)-) 
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is  true,  we  know  that  the  second  disjunct, 

9(v(t+,  t+>  0+>  «(*+,  t+,  s~,  «")“). 

is  true. 

In  each  case,  we  have  shown  that  the  desired  conclusion  is  true. 


REPLACEMENT  WITH  RELATION  MATCHING:  GENERAL  VERSION 

The  general  version  of  the  rule  allows  us  to  instantiate  the  variables  of  the  given  sentences  as  necessary 
and  then  to  apply  the  ground  version.  We  omit  the  precise  statement,  which  is  analogous  to  the  general 
version  of  the  relation  replacement  rule,  but  we  illustrate  the  general  version  with  an  example  extracted 
from  the  derivation  of  a  program  to  sort  a  list  of  numbers. 

Example 

In  a  theory  of  lists  of  (say)  integers,  suppose  our  deduced  set  contains  the  sentences 

=  perm(x i  n  x2,  yi  □  y2)j 

and 

Q  :  not  (ordered(z)  and  perm(l+ ,  z)  ). 

Here  the  term  x\  □  x2  is  the  result  of  appending  the  lists  Xi  and  rc2,  and  the  term  (u)  is  the  list  whose  sole 
element  is  u.  Also,  perm(£,  z)  holds  if  the  list  £  is  a  permutation  of  the  list  z,  and  ordered(z)  holds  if  the 
elements  of  z  are  in  (weakly)  increasing  order.  In  the  derivation,  7  is  one  of  the  axioms  for  the  permutation 
relation,  which  states  that  two  lists  tire  permutations  if  they  are  still  permutations  after  dropping  a  common 
element,  and  Q  is  the  negation  of  the  theorem,  which  states  the  existence  of  an  ordered  list  that  is  a 
permutation  of  a  given  list. 

The  results  of  applying  the  substitution 

O'.  {z  <-  yi  °((«)  °y2)} 

to  the  boxed  subsentences  are 

perm((xi  □  ((u)  □  x2))+ ,  yi  a  «tt  )  □  y2)) 

and 

perm(£+,  yi  o  ((u)  a  y2)). 

The  mismatched  subterms 

xi  □  ((u)  n  x2)  and  £ 

are  positive  in  their  respective  subsentences  with  respect  to  the  perm  relation.  (Because  this  relation  is 
symmetric,  they  also  happen  to  be  negative.)  The  boxed  subsentence  perm(£,  z)  is  positive  in  $  with  respect 
to  the  equivalence  relation  =.  (It  also  happens  to  be  negative.)  Therefore,  by  the  =-replacement  rule  with 
perm-matching,  we  may  deduce  the  sentence 

if  perm(xi  □  ((u)  n  x2),l) 
then  false 
or 

not  (ordered(yi  □  ((«)  □  y2))  and  perm(xi  a  x2,  yi  □  y2)) 


perm(£+,  z) 


perm(xi  □  ((u)  □  x2),  yi  □  ((ti)  □  y2)) 
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which  reduces  under  transformation  to 
if  perm(xi  □  ((tt)  □  X2),  £) 

then  not  (ordered(y%  a  ((u)  a  y2))  and  perm[xx  □  x2,  J/i  °  1/2) )  -  j 


RELATION  MATCHING  VERSUS  RELATION  REPLACEMENT 


The  relation  matching  and  relation  replacement  rules  play  complementary  roles,  and  one  might  expect 
that  a  single  deductive  system  would  employ  one  or  the  other  rule  but  not  both.  After  all,  in  clausal  equality 
systems,  paramodulation  and  a  variant  of  E-resolution  have  each  been  shown  to  be  complete  (Anderson  [70], 
Digricoli  [83],  and  Brand  [75])  without  including  the  other.  Moreover,  by  incorporating  both  rules,  we  admit 
a  troublesome  redundancy:  The  same  conclusion  can  be  derived  in  several  ways. 


On  the  other  hand,  it  often  turns  out  that  a  proof  that  seems  unmotivated  or  tricky  using  only  one  of 
the  rules  seems  more  straightforward  using  a  combination  of  both.  For  instance,  in  an  example  of  a  previous 
section,  we  applied  the  resolution  rule  with  relation  matching  to  the  sentences 


7: 


if  g(u) 
then 


p(uH 


and 


S'- 


p  (£+,/(£)  +  ) 


taking  the  substitution  to  be 


6  :  {u  <—  £}, 

to  obtain  after  transformation 


if  £  -<  /(£) 


then  notq(t). 


If  our  deduced  set  also  contains  the  sentence 

v  ■<  /(»)  |, 

we  can  further  deduce  (by  resolution)  the  sentence 
not  ?(£). 

Now  suppose  our  deductive  system  includes  the  relation  replacement  rule  but  not  the  relation-matching 
rule.  Then  to  deduce  the  same  conclusion  not  <j(£),  we  would  have  to  apply  the  relation  replacement  rule  to 
the  sentences 

Ef  -<  m 

and 

5 ■  notp{ 


p  >  m) 


to  obtain  (after  transformation) 
not  |  p (/(£),  }{t)) 
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We  could  then  obtain  the  same  conclusion  (not  <j(£))  by  resolution  applied  to  this  sentence  and  the  sentence 


if  q{u) 
then 


p{u,  u) 


Although  both  sequences  of  inference  lead  to  the  same  conclusion,  the  earlier  proof  seems  better  mo¬ 
tivated:  Each  step  is  based  on  matching  subexpressions  that  already  possess  a  high  degree  of  syntactic 
similarity.  In  contrast,  the  above  proof  seems  rather  gratuitous:  The  application  of  the  relation  replacement 
rule  is  based  on  matching  the  variable  v  with  the  constant  £.  There  is  no  reason  to  perform  this  step  except 
as  a  preparation  for  the  subsequent  resolution  step. 


Examples  can  also  be  exhibited  for  which  a  proof  employing  the  replacement  rule  is  well-motivated  but 
the  corresponding  proof  using  the  matching  rule  appears  strained.  For  instance,  in  the  theory  of  integers, 
use  of  the  =-replacement  rule  and  the  axiom  u  +  (— u)  =  0  allows  us  to  simplify  a  subterm  of  form  t  +  (— t) 
to  0.  If  we  are  only  permitted  to  use  the  relation-matching  rule,  we  must  leave  the  subterm  intact,  and  hope 
that  we  attempt  to  match  it  against  a  corresponding  subterm  0  later  in  the  proof. 


We  expect  that  by  including  both  rules  together  in  a  system  we  shall  be  able  to  apply  more  restrictive 
strategies  to  each  of  them.  Consequently,  we  shall  obtain  a  smaller  search  space  than  if  we  had  included 
either  of  the  rules  separately. 


7.  STRENGTHENING 


The  relation  replacement  rule  of  Section  5  does  not  always  allow  us  to  draw  the  strongest  possible 
conclusion.  In  this  section  we  establish  a  stronger  form  of  the  polarity  replacement  lemma  and  use  it  to 
develop  a  stronger  relation-replacement  rule. 

We  motivate  the  strengthening  of  the  rule  with  an  example.  In  the  theory  of  the  integers,  suppose  our 
deduced  set  contains  the  sentences 

7  :  [TJ  <  t 

and 

5  •  a  <  R+  +  2- 

Because  the  occurrence  of  s  in  Q  is  positive  with  respect  to  the  less-than  relation  <,  the<-replacement  rule 
allows  us  to  replace  s  with  t  and  deduce  that  (after  transformation) 

a  <  t+  +  2. 

From  these  two  sentences,  however,  we  should  be  able  to  deduce  the  stronger  result 
a  <  t  +  2. 

Similarly,  from  the  sentence  s  <  t  and  not  (a  —  s  >  b),  we  should  be  able  to  deduce  not  (a  —  t  >  b)  rather 
than  merely  not  (a  —  t  >  b). 

Unfortunately,  the  rule  as  we  have  presented  it  does  not  yield  these  more  useful  conclusions;  the  strength¬ 
ened  relation-replacement  rule  will.  But  first,  we  must  introduce  some  preliminary  notions. 


THE  STRENGTHENED  POLARITY-REPLACEMENT  LEMMA 


The  strengthened  rule  depends  on  the  following  basic  result: 
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Lemma  (strengthened  polarity  replacement) 

Consider  arbitrary  expressions  e(x,y)  and  e'(x,y)  and  binary  relations  Hi  and  H2.  The  sentence 
if  x  Hi  y 

then  if  e(x,y)  -<2  e'(x,  y) 
then  e(y,x)  <2  e'(y,x) 

is  valid  provided  that  the  replaced  occurrences  of  x  and  y  satisfy  the  following  strengthening  con¬ 
ditions  [in  e{x,  y)  and  e'(x,y)  with  respect  to  Hi  and  -<2]: 

•  transitivity  condition 

The  relation  -<2,  the  irreflexive  restriction  of  -<2,  is  transitive. 

•  top  condition 

The  replaced  occurrences  of  x  and  y  are  respectively  positive  and  negative  in  e(x,y)  H2 
e'{x,y)  with  respect  to  Hi. 

•  left-right  condition 

One  of  the  following  two  disjuncts  holds: 

The  replaced  occurrences  of  x  and  y  in  e(x,  y)  are  respectively  negative  and  positive  in 
e(x,  y)  with  respect  to  Hx  and  H2  (and  some  replacement  is  made  in  e(x,  y)) 

( left  disjunct ) 


or 

the  replaced  occurrences  of  x  and  y  in  e'(x,y)  are  respectively  positive  and  negative  in 
e'(x,y)  with  respect  to  Hi  and  H2  (and  some  replacement  is  made  in  e'(x,y)). 

( right  disjunct) 


Before  proving  this  proposition,  let  us  illustrate  it  with  an  example. 


Example  (strengthened  polarity-replacement  lemma) 

In  a  theory  that  includes  the  sets  and  the  nonnegative  integers,  take  Hi  to  be  the  proper-subset  relation 
C  over  the  sets  and  H2  to  be  the  weak  less-than  relation  <  over  the  nonnegative  integers.  Then  H2  is  the 
strict  less-than  relation  <. 

Consider  the  sentence 

m  •  card(y )  <  n  +  card{x), 

where  x  and  y  are  sets,  m  and  n  are  nonnegative  integers,  and  card(x)  is  the  cardinality  of  the  set  x. 
According  to  the  lemma,  the  sentence 

if  x  <Z  y 

then  if  m  ■  card(y)  <  n  +  card(x) 
then  m  •  card(x)  <  n  +  card(y) 

is  valid,  because  the  replaced  occurrences  of  x  and  y  satisfy  the  strengthening  conditions  in  m  ■  card(y)  and 
n  +  card(x)  with  respect  to  C  and  <.  In  particular, 

•  The  relation  <  is  transitive;  hence  the  transitivity  condition  is  satisfied. 
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•  The  replaced  occurrences  of  x  and  y  are  respectively  positive  and  negative  in  m  ■  card(y)  < 
n  +  card(x)  with  respect  to  C;  hence  the  top  condition  is  satisfied. 

•  Although  the  replaced  occurrence  of  y  is  not  positive  in  m  ■  card(y)  with  respect  to  c 
and  <  (after  all,  m  could  be  0),  the  replaced  occurrence  of  x  is  positive  in  n  +  card(x) 
with  respect  to  c  and  <.  Hence,  though  the  left  disjunct  of  the  left-right  condition  is  not 
satisfied,  the  right  disjunct  is. 

We  are  now  ready  to  establish  the  lemma. 

Proof  (strengthened  polarity-replacement  lemma) 

Suppose  that 

x-<i  y  and  e(x,y)  H2  e'(x,y), 
and  that  the  strengthening  conditions  are  satisfied. 

We  would  like  to  show  that  then 
e(y,x)  <2  e'{y,x). 

The  left-right  condition  was  stated  as  a  disjunction  of  two  possibilities;  we  treat  each  possibility  sepa¬ 
rately. 

Case  ( left  disjunct):  The  replaced  occurrences  of  x  and  y  in  e{x,  y)  are  respectively  negative  and 
positive  in  e(x,  y)  with  respect  to  Hi  and  X2  (and  some  replacement  is  made  in  e(x,  y)). 

In  this  case  (by  the  transitive  polarity-replacement  lemma,  because  x  Hi  y),  we  have 

e(y,x)  -<2  e(x,y). 

Also  (by  the  polarity  replacement  proposition  and  our  supposition  that  x  Hi  y  and  e(x,  y)  H2  e'(x,  y)) 
we  have 


e(x,  y)  H2  e'(y,  x). 

(Here  we  have  only  performed  the  replacements  on  the  right-hand  side;  by  the  top  condition,  we  know  the 
replaced  occurrences  of  x  and  y  are  respectively  positive  and  negative  in  e(x,y)  H2  e'(r,  y)  with  respect  to 
Hi.)  It  follows  that 

e(*i  V )  <2  e'(y,  x)  or  e{x ,  y)  =  e'(y,  x). 

Because  e(y,  x)  X2  e{x,  y),  we  thus  have  (either  by  the  transitivity  of  X2  or  the  substitutivity  of  equality) 

that 


e(y,x)  X2  e'{y,  x), 
as  we  wanted  to  show. 

Case  ( right  disjunct):  The  replaced  occurrences  of  x  and  y  in  e'(x,y)  are  respectively  positive  and 
negative  in  e'(x,y)  with  respect  to  Hi  and  X2  (and  some  replacement  is  made  in  e(x,y)). 

The  proof  in  this  case  is  entirely  symmetric  to  the  proof  in  the  previous  case. 


THE  STRENGTHENED  POLARITY-REPLACEMENT  PROPOSITION 


The  strengthened  rule  is  expressed  in  terms  of  the  following  notational  device: 
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Definition  (strengthen  accordingly) 

Suppose  -4  is  a  binary  relation,  s  and  t  are  expressions  (either  both  sentences  or  both  terms),  and 
Q  is  a  sentence. 

If  we  write  §  as  Q{s+ ,  t~),  then  $(t+,  s“)T  denotes  the  sentence  obtained  by  replacing  certain 
positive  occurrences  of  s  with  t,  replacing  certain  negative  occurrences  of  t  with  s  (where  polarity 
is  taken  with  respect  to  -<),  and  strengthening  accordingly  as  follows: 

•  Whenever  a  replacement  is  made  in  a  positive  subsentence  of  form  e{s,  t)-i  e'(s,t),  where 
the  replaced  occurrences  of  s  and  t  satisfy  the  strengthening  conditions  in  e(s,  t)  and  e'(s,  t) 
with  respect  to  -4  and  -4,  replace  the  occurrence  of  the  symbol  -4  with  -<,  the  irreflexive 
restriction  of  -4. 

•  Whenever  a  replacement  is  made  in  a  negative  subsentence  of  form  e(s,  t)3  e'(s,  t),  where 
the  replaced  occurrences  of  s  and  t  satisfy  the  strengthening  conditions  in  e(s,  t)  and  e'(s,  t) 
with  respect  to  -4  and  ■/{,  replace  the  occurrence  of  the  symbol  -4  with  (Here  -ft  and  ^ 
are  the  negation,  and  the  reflexive  closure,  respectively,  of  -<.) 


These  conditions  may  appear  mysterious  at  this  point,  but  they  are  precisely  what  we  need  to  establish 
the  following  result,  which  tightens  up  the  polarity  replacement  proposition: 


Proposition  (strengthened  polarity  replacement) 

For  any  binary  relation  -4  and  sentence  P(x+,  y~),  the  sentence 
if  x  -4  y 

then  if  P(x+,  y~) 

then  P(y+,  i~)T 

is  valid. 


We  illustrate  the  proposition  with  two  examples. 


Example 

In  the  theory  of  the  positive  integers  (excluding  0),  take  -4  to  be  the  proper-divides  relation  <div  and 
take  our  sentence  to  be 

P(x+,y~):  a  <  (s  +  l)2  or  q{x). 

Then  according  to  the  proposition,  the  sentence 

if  %  ~^div  y 

then  if  a  <  {x  +  l)2  or  q[x) 
then  a  <  (y  +  l)2  or  q(x) 

is  valid.  Note  that  the  symbol  <  has  been  replaced  by  its  irreflexive  restriction  <  as  a  result  of  the  strength¬ 
ening.  This  is  because 

•  The  subsentence  a  <  (x  +  l)2  is  positive  in  P(x+,  y~). 

•  The  replaced  occurrence  of  x  in  a  <  (x  +  l)2  satisfies  the  strengthening  conditions  in  a 
and  ( x  +  l)2  with  respect  to  -<div  and  <.  In  particular 

■  The  relation  <  is  transitive;  hence  the  transitivity  condition  is  satisfied. 
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■  The  replaced  occurrence  of  x  is  positive  in  a  <  (x  +  l)2  with  respect  to  -<d,„; 
hence  the  top  condition  is  satisfied. 

■  The  replaced  occurrence  of  x  is  positive  in  [x  +  l)2  with  respect  to  <div  and  <; 
hence  the  right  disjunct  of  the  left-right  condition  is  satisfied. 


Example 

In  a  theory  that  includes  the  lists  and  the  nonnegative  integers,  take  H  to  be  the  tail  relation  <taii  over 
the  lists  and  take  our  sentence  to  be 

P(x+,  y~)  :  if  length(xnl)  <  length[y)  +m  then  q(x,y), 

where  x,  y,  and  £  are  lists,  mis  a  nonnegative  integer,  and  length(i)  is  the  number  of  elements  in  the  list  £. 
Then  according  to  the  proposition,  the  sentence 

if  x  y 

then  if  if  length(x  □  £)  <  length(y)  +  m  then  q(x,y) 
then  if  length(y  a  £)  <  length(x)  +  m  then  q(x,  y) 

is  valid.  Note  that  here  the  symbol  <  has  been  replaced  by  <  as  a  result  of  the  strengthening.  This  is 
because 

•  The  subsentence  length[x  □  £)  <  length{y)  +  m  is  negative  in  P( x+ ,  y~). 

•  The  replaced  occurrences  of  x  and  y  satisfy  the  strengthening  conditions  in  length[x  □  £) 
and  length(y)  +  m  with  respect  to  and  yi,  that  is  >.  In  particular 

■  The  relation  >,  the  irreflexive  restriction  of  >,  is  transitive;  hence  the  transitivity 
condition  is  satisfied. 

■  The  replaced  occurrences  of  x  and  y  are  positive  and  negative,  respectively,  in 
the  sentence  length[x  a  £)  >  length{y)  +  m  with  respect  to  ;  hence  the  top 
condition  is  satisfied. 

■  The  replaced  occurrence  of  x  is  negative  in  length[x  □  £)  with  respect  to  <taii 
and  >;  hence  the  left  disjunct  of  the  left-right  condition  is  satisfied.  (As  it  turns 
out,  the  replaced  occurrence  of  y  is  also  negative  in  length(y)  +  m  with  respect 
to  -<taii  and  >;  hence  the  right  disjunct  is  also  satisfied.) 


Let  us  now  prove  the  proposition. 


Proof  (strengthened  polarity-replacement  proposition) 

We  suppose  that 

x -<  y  and  P(x+,y~), 
and  show  that  then 

P(y+,  s~)T- 

The  sentence  P{y+ ,  s_)T  is  obtained  from  P{x+,  y~)  by  replacing  certain  subexpressions  with  others.  We 
show  that  each  of  these  replacements  makes  the  sentence  “truer,”  in  the  sense  that  it  produces  a  sentence 
implied  by  the  original. 

We  consider  separately  three  kinds  of  replacement: 
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•  Replacing  a  positive  subsentence  of  form  e{x,  y)3  e'(x,  y)  with  e(y,  x)<  e'{y,  x),  where  the  replaced 
occurrences  of  x  and  y  satisfy  the  strengthening  conditions  in  e(x,  y)  and  e'(x,  y)  with  respect  to  -4 
and  3. 

In  this  case,  because  x  -4  y,  we  have  (by  the  strengthened  polarity-replacement  lemma)  that 

if  e(x,y)  3  e'{x,y) 
then  e(y,x)  <e'(y,x). 

Therefore,  because  the  replaced  occurrence  of  e(x,y)^4  e'(x,y)  is  positive  in  P(x+,  y~),  we  know  (by  the 
original  polarity-replacement  proposition)  that  replacing  it  with  the  “truer”  subsentence  e(y,x)<  e'(y,x) 
makes  the  entire  sentence  truer. 

•  Replacing  a  negative  subsentence  of  form  e(x,  y)3  e'(x,  y),  with  e(y,  x)d  e'(y,  x),  where  the  replaced 
occurrences  of  x  and  y  satisfy  the  strengthening  conditions  in  e{x,  y)  and  e'(x,  y)  with  respect  to  -4 
and  -/k  (the  negation  of  3). 

A/ 

In  this  case,  because  x  -4  y,  we  have  (by  the  strengthened  polarity-replacement  lemma,  recalling  that  d 
is  the  irreflexive  restriction  of  jk) 

if  e(x,y)%  e'(x,y) 
then  e(y,  x)d  e'(y,x) 

or,  equivalently  (taking  the  contrapositive), 

if  e(y>  #)  d  e'(y,x) 
then  e(x ,  y)  e'(x,y). 

Therefore,  because  the  replaced  occurrence  of  e(x,y)^4  e'(x,y)  is  negative  in  P{x+,  y~),  we  know  (by  the 
original  polarity-replacement  proposition)  that  replacing  it  with  the  “falser”  sentence  e(y,x)de'{y,x)  will 
make  the  entire  sentence  falser. 

•  Replacing  a  positive  occurrence  of  x  with  y  or  a  negative  occurrence  of  y  with  x,  where  polarity  is 
with  respect  to  -4  and  where  the  replaced  occurrence  is  not  within  the  scope  of  any  strengthened 
relation  3. 

In  this  case,  the  replacement  makes  the  sentence  “truer,”  by  the  original  polarity-replacement  proposi¬ 
tion. 


THE  GROUND  VERSION 

We  can  now  express  the  stronger  version  of  the  relation  replacement  rule.  The  ground  version  of  the 
rule  is  as  follows: 

Rule  (strengthened  relation  replacement,  ground  version) 

For  any  binary  relation  -4,  ground  expressions  s  and  t,  and  ground  sentences  7[s  -<  i]  and  Q{s+ ,  f-), 
we  have 

7\s  -4  t] 

9(s+,  *~) 

7 [false]  or  $(t+,  s~)1 
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Here  Q(t+ ,  s“)t  is  the  result  of  replacing  certain  positive  occurrences  of  s  with  t,  replacing  certain 
negative  occurrences  of  t  with  s,  and  strengthening  accordingly,  where  polarity  is  taken  in  ,  t~) 
with  respect  to  We  assume  that  at  least  one  replacement  is  made. 

Let  us  illustrate  the  ground  version  of  the  rule  with  two  examples. 

Example 

In  the  theory  of  the  positive  integers  (excluding  0),  suppose  our  deduced  set  contains  the  sentences 
7  :  if  p(s)  then  jTJ  <div  t 

and 

5  :  «  <  (|  8  | '  +  l)2  or  g(s), 

where  -<*•„  is  the  proper  divides  relation.  Then  we  can  apply  the  strengthened  -<d,„-replacement  rule  to 
replace  the  boxed  occurrence  of  s  in  Q  with  t  and  to  strengthen  accordingly,  obtaining 

if  p[s)  then  false 
or 

a  <  (t+  l)2  or  g(s). 

This  sentence  reduces  under  transformation  to 
(notp(s))  or  a  <  (f  +  l)2  or  g(s). 

The  relation  symbol  <  was  replaced  by  its  irreflexive  restriction  <  because  a  <  (s  +  l)2  is  positive  and 
because  s  and  t  satisfy  the  strengthening  conditions  in  a  and  (s  +  l)2  with  respect  to  <div  and  <,  as  we 
have  seen  in  a  previous  example. 


Example 

In  a  theory  that  includes  the  sets  and  the  nonnegative  integers,  suppose  our  deduced  set  contains  the 
sentences 


7  :  p{s,t)  or  [7]  c[T| 


$:  not  (q{s,t)  and  m  ■  card( )<  n  +  card(  £tj"  , 


where  s  and  t  are  sets,  m  and  n  are  nonnegative  integers,  and  card(s)  is  the  cardinality  of  the  set  s.  Then 
we  can  apply  the  strengthened  C-replacement  rule  to  replace  the  boxed  occurrences  of  s  with  t  and  t  with 
s  and  to  strengthen  accordingly,  obtaining 

p(s,t)  or  false 


or 

not  (q(s,  t)  and  m  ■  card[t)  <  n  +  card(s)  j  , 


that  is  (after  transformation), 


p(s,t)  or 

not  (q(s,  t)  and  m  ■  card{t)  <  n  +  card(s)^  . 
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The  relation  symbol  <  has  been  replaced  by  its  reflexive  closure  <  because  m  ■  card(s)  <  n  +  card(t)  is 
negative  and  because  s  and  t  satisfy  the  strengthening  conditions  in  m'card(s)  and  n  +  card{t)  with  respect 
to  C  and  ft,  that  is,  >.  In  particular, 


•  The  irreflexive  restriction  >  of  >  is  transitive;  hence  the  transitivity  condition  is  satisfied. 

•  The  replaced  occurrences  of  s  and  t  are  respectively  positive  and  negative  in  m  •  card(s)  < 
n  +  card(t)  with  respect  to  C  and  >;  hence  the  top  condition  is  satisfied. 

•  The  replaced  occurrence  of  t  is  negative  in  n  + car d[t)  with  respect  to  C  and  >;  hence  the 
right  disjunct  of  the  left-right  condition  is  satisfied. 


Let  us  now  establish  the  soundness  of  the  rule. 


Justification  (relation  replacement  rule,  ground  version) 

The  proof  resembles  the  justification  of  the  original  relation-replacement  rule. 

We  suppose  that  the  given  sentences  7[s  -4  f]  and  Q(s+ ,  t~)  are  true  and  show  that  the  newly  deduced 
sentence  [false]  or  ${t+ ,  a~)f)  is  also  true.  We  distinguish  between  two  cases  and  show  that  in  each  case 
one  of  the  two  disjuncts,  7 [false]  or  Q{t+ ,  s~Y ,  is  true. 

In  the  case  in  which  the  subsentence  s  -4  t  is  false,  we  know  (by  the  value  property,  because  s  -4  t  and 
false  have  the  same  truth  value  and  7[s  -4  t]  is  true)  that  the  first  of  the  disjuncts,  7[false],  is  true. 

In  the  case  in  which  s  -4  t  is  true,  we  know  (by  the  strengthened  polarity-replacement  proposition, 
because  @(s+,  t~)  is  true)  that  the  second  of  the  disjuncts,  5{t+ ,  s“)^,  is  true. 


THE  GENERAL  VERSION 

The  general  version  of  the  rule  allows  us  to  instantiate  the  variables  of  the  sentences  as  necessary  to 
create  common  subexpressions. 


Rule  (strengthened  relation  replacement,  general  version) 

For  any  binary  relation  -4,  expressions  s,  t,  s,  and  t,  and  sentences  7\s  -4  f]  and  Q{s+ ,  t+),  where 
7  and  Q  are  standardized  apart,  we  have 

7\s  -4  t] 

$(s+,  t ") 

78[false]  or  Q6{td+,  sd~)^ , 

where  8  is  a  simultaneous,  most-general  unifier  of  s,  s  and  of  t,  t. 


As  usual,  to  apply  the  general  version  of  the  rule  to  sentences  7  and  Q,  we  apply  its  ground  version  to  78 
and  Q 8 .  The  justification,  which  is  straightforward,  is  omitted.  As  before,  the  polarity  strategy  for  the  rule 
allows  us  to  assume  that  a  least  one  occurrence  of  the  subsentence  (s  ~4  t)8  is  positive  or  of  no  polarity  in 


78. 
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8.  EXTENSIONS 


The  concepts  in  this  paper  are  being  extended  in  several  directions.  We  briefly  indicate  several  of  these 


here. 


EXPLICIT  QUANTIFIERS 

The  system  we  have  described  deals  with  sentences  that  have  had  their  quantifiers  removed  by  skolem- 
ization.  It  is  impossible,  however,  to  remove  quantifiers  that  occur  within  the  scope  of  an  equivalence  (=) 
connective  or  in  the  i/-clause  of  a  conditional  ( if-then-else )  connective  without  first  paraphrasing  the  con¬ 
nective  in  terms  of  others.  If  several  of  these  connectives  are  nested,  the  paraphrased  sentence  becomes 
alarmingly  complex. 

In  an  earlier  work  (Manna  and  Waldinger  [82]),  we  extend  the  deductive  system  to  sentences  that 
may  have  some  of  their  quantifiers  intact.  In  many  cases,  we  can  complete  the  proof  without  removing  all 
the  quantifiers.  If  these  quantifiers  are  in  equivalences  or  t/-clauses,  we  need  not  paraphrase  the  offending 
connectives.  Thus,  we  not  only  retain  the  form  of  the  original  sentence,  but  also  can  use  the  equivalences 
we  retain  in  applying  the  equivalence  replacement  rule. 


POLARITY  WITH  RESPECT  TO  AN  EXPRESSION 

We  have  used  the  notion  of  polarity  with  respect  to  a  relation.  Because  a  function  is  a  special  case  of 
a  relation,  we  can  define  polarity  with  respect  to  a  function  accordingly.  Rather  than  restricting  ourselves 
to  the  functions  denoted  by  the  function  symbols  in  our  deduced  set,  we  prefer  to  consider  the  functions 
corresponding  to  particular  expressions  in  the  set. 

Roughly  speaking,  suppose  e[s]  is  a  ground  term;  then  e[s]  corresponds  to  a  binary  relation  -<e[»]  defined 
by  the  sentence 

x  -4e\»]  y  =  eM  =  y- 

We  may  define  polarity  with  respect  to  -«<,[,]  just  as  we  would  with  respect  to  any  binary  relation. 

For  example,  in  the  theory  of  the  integers,  the  relation  He[4j  corresponding  to  the  term  e[s]  :  s  +  1  is 
defined  by  the  sentence 

X  -<e[«]  y  =  x+1  =  y. 

(In  fact,  this  relation  turns  out  to  be  the  predecessor  relation  -<pred  we  have  seen  earlier.)  The  relation 
natnum{ x),  which  holds  if  x  is  a  nonnegative  integer  (natural  number),  is  positive  over  its  argument  with 
respect  to  -<e[4] ,  for  we  have 

if  x  -te[<!  y 
then  if  natnum(x) 

then  natnum(y). 

We  can  then  establish  an  expression  replacement  rule  analogous  to  our  relation  replacement  rule;  i.e., 
in  the  ground  version: 

For  any  expressions  s  and  e[s]  and  ground  sentence  $(s+,  e[s]— ) ,  we  have 

g(s+,  e[s]-) 


9(e[s}+,  s~) T 
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Here  £(e[a]+,  a-)*  is  obtained  from  £(s+,  e[a]-)  by  replacing  certain  positive  occurrences  of  a  with  e[s], 
replacing  certain  negative  occurrences  of  e[a]  with  a,  and  strengthening  accordingly,  where  polarity  is  taken 
in  p(a+,  e[s]~)  with  respect  to 

For  example,  in  the  theory  of  the  integers,  if  our  deduced  set  contains  the  sentence 
Q  :  not  [nafnti7n((s  +  l)~)] 
we  may  deduce  the  sentence 
not  [natnum(s)] , 

because  the  occurrence  of  a  + 1  is  negative  in  Q  with  respect  to  the  relation  corresponding  to  the  expression 

a  +  l. 


We  can  also  define  expression-matching  rules  analogous  to  our  relation-matching  rule. 


For  example,  in  the  theory  of  lists,  suppose  our  deduced  set  contains  the  sentences 


fi 


a  e  a+ 


and 


s-- 


not  ( 


a  6.  (bo  s)+  ). 


Here  the  term  bos  is  the  result  of  inserting  the  element  b  before  the  first  element  of  the  list  a.  By  the  resolution 
rule  with  expression  matching,  whose  precise  statement  we  omit,  we  may  deduce  (after  transformation),  the 
contradiction  false,  because  s  is  positive  in  the  boxed  sentence  o  6  a  with  respect  to  the  relation  corresponding 
to  6  o  s. 


CONDITIONAL  POLARITY 


Sometimes  it  is  convenient  to  extend  the  notion  of  polarity  to  depend  on  the  truth  of  certain  conditions. 
For  example,  in  the  theory  of  integers  (including  negative  integers)  with  respect  to  the  relation  <,  the 
occurrence  of  a  in  the  sentence 


a  <  b  ■  a 


might  be  regarded  as  positive  if  6  is  nonnegative  and  negative  if  b  is  nonpositive.  (If  b  is  0,  the  occurrence 
might  have  both  polarities).  We  could  then  adapt  the  relation  replacement  and  relation  matching  rules  to 
use  this  conditional  polarity,  imposing  the  appropriate  conditions  on  whatever  conclusion  they  draw. 


More  precisely,  we  define  the  notion  of  conditional  polarity  so  that  if  x  and  y  are  respectively  positive 
and  negative  in  P(x+,  y~)  with  respect  to  the  binary  relation  -<  subject  to  the  condition  P[x,y,  Q],  then  the 
sentence 


x, 


y, 


if  x  •<  y 

then  if  P(x+,  y~) 

then  P(y+,  x~)* 


is  valid.  Here  Q  denotes  an  arbitrary  sentence;  the  indicated  polarities  of  the  replaced  occurrences  of  x  and 
y  are  subject  to  the  condition  P\x,  y,  Q\. 


For  example,  according  to  this  notion  of  conditional  polarity,  in  the  theory  of  the  integers,  the  occurrence 
of  x  in  the  sentence 


a  <  b  +  x2 
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is  positive  with  respect  to  the  relation  <  subject  to  the  condition 

X{x,  y,  Q] : 

Consequently,  we  have  that  the  sentence 


if  x>0 
then  Q. 


if  x>0 
then  if  x  <  y 

then  if  a  <  6  +  x2 

then  a  <  b  +  J/2 

is  valid.  The  relation  <  was  replaced  by  <  as  the  result  of  strengthening. 

In  terms  of  this  notion,  we  can  introduce  conditional  versions  of  the  relation  replacement  rule  and 
relation-matching  rules.  In  particular,  we  have  the  conditional  relation-replacement  rule,  i.e.,  in  the  ground 
version: 


For  any  binary  relation  ground  expressions  s  and  t,  and  ground  sentences  7\s  -<  t]  and  £(s+,  t  ), 
we  have 

7\s  -i  t\ 

9{a+ ,  O 

H[s,t,  false]  or  7[false]  or  ${t+,  s_)T. 

Here  the  indicated  polarities  of  the  replaced  occurrences  of  s  and  t  are  subject  to  the  condition 

n\s,t,o]. 

For  example,  in  the  theory  of  the  integers,  suppose  our  deduced  set  contains  the  sentences 

j  .  */  r(s.  0 

then  s  <  t 

and 

Q  :  a  <  b  ■  s. 

Note  that  the  occurrence  of  s  in  Q  is  positive  with  respect  to  the  relation  <  subject  to  the  condition 

if  b>  0 
then  Q. 

Therefore,  according  to  the  conditional  <-replacement  rule,  we  may  deduce 


'if  b>  0 

/if 

if  r(s,  t) 

then  false 

U1 

then  false 

which  reduces  under  transformation  to 

(not  (6  >  0))  or  (not  (r(s,  t)))  or  a  <b-t. 

The  conditional  relation-matching  rules  are  analogous.  Of  course  these  rules  can  be  extended  to  apply 
to  conditional  polarity  with  respect  to  an  expression  rather  than  a  relation. 


PLANNING  AND  THE  FRAME  PROBLEM 

Theorem-proving  techniques  have  often  been  applied  to  problems  in  automatic  planning.  One  approach 
to  this  application  has  been  the  formulation  of  a  situational  logic,  a  theory  in  which  states  of  the  world  are 
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regarded  as  domain  elements,  denoted  by  terms.  Typically,  an  action  in  a  plan  is  represented  as  a  function 
mapping  states  into  other  states.  The  effects  of  an  action  can  be  described  by  axioms. 

For  example,  the  primary  effect  of  putting  one  block  on  top  of  another  is  expressed  by  an  axiom  such  as 

if  clear(x,  tv)  and  clear(y,  to) 
then  on[x,y,puton[x,y,  to)). 

In  other  words,  if  block  x  is  put  on  block  y  in  a  state  to,  then  x  will  indeed  be  on  y  in  the  resulting  state 
puton[x,y,  tv).  The  antecedent  expresses  the  preconditions  that  x  and  y  be  clear  before  x  can  be  put  on  y; 
in  other  words,  no  block  can  be  on  x  or  on  y.  (The  conventional  blocks-world  hand  can  move  only  one  block 
at  a  time.) 

In  a  situational  logic,  a  problem  may  be  expressed  as  a  theorem  to  be  proved.  For  example,  the  problem 
of  achieving  the  condition  that  block  a  is  on  block  b  and  block  b  is  on  block  c  might  be  phrased  as  the 
theorem 

[3z)[on[a,b,z)  and  on[b,c,z)]. 

The  frame  problem,  which  occurs  when  planning  problems  are  approached  in  this  way,  is  connected  with 
the  requirement  that  we  need  to  express  not  only  what  conditions  are  altered  by  a  given  action,  but  also 
what  conditions  are  unchanged.  For  example,  in  addition  to  the  primary  effect  of  putting  one  block  on  top 
of  another,  we  must  state  explicitly  that  this  action  has  no  effect  on  other  relations,  such  as  color;  otherwise, 
we  shall  have  no  way  of  deducing  that  the  color  of  a  block  after  the  action  is  the  same  as  its  color  before. 
Therefore,  we  must  include  in  our  deduced  set  the  frame  axiom 

if  clear[x,  tv)  and  clear(y,  to) 
then  if  color[z,  u,  tv) 

then  color[z,u,puton[x,y,  tv)). 

In  other  words,  if  the  action  of  putting  block  x  on  top  of  block  y  is  legal  and  if  block  z  is  of  color  u  in  state 
tv,  then  z  will  also  be  of  color  u  in  the  resulting  state  puton[x,  y,  tv).  If  our  deduced  set  contains  the  sentence 

not  [color[c,  red,  puton[a,  b,  s))), 

we  can  then  apply  the  resolution  rule  to  the  frame  axiom  and  this  sentence  to  deduce  (after  transformation) 

[not  [clear[a,  s)))  or  [not  [clear [b,  s)))  or  [not  [color[c,  red,  s))). 

We  need  a  separate  frame  axiom  not  only  for  the  color  of  blocks,  but  also  their  size,  shape,  surface 
texture,  and  any  other  attributes  we  wish  to  discuss  in  our  theory.  Adding  all  the  frame  axioms  to  our 
deduced  set  aggravates  the  search  problem,  because  the  axioms  have  many  consequences  irrelevant  to  the 
problem  at  hand. 

By  use  of  the  conditional  expression  rules,  we  can  drop  all  the  frame  axioms  from  our  deduced  set. 
For  example,  to  paraphrase  the  above  axiom  we  can  declare  that  the  relation  color[z,  u,  tv)  is  positive  with 
respect  to  the  relation  corresponding  to  the  expression  e[to]  :  puton[x,  y,  tv)  subject  to  the  condition 

wr  ...i  q]  .  if  clear[x,  tv)  and  clear[y,  w) 

1  ’  ’  J  '  then  Q. 

If  our  deduced  set  again  contains  the  sentence 
not  [color  [c,  red,puton(a,  6,  s)-)), 

we  can  then  apply  the  conditional  expression-replacement  rule  to  deduce 

[not  [clear[a,  s)))  or  [not  [clear [b,  s)))  or  [not  [color [c,  red,  s))) 

as  before,  without  requiring  the  frame  axiom.  Of  course,  the  information  that  certain  actions  and  relations 
are  independent  must  still  be  expressed,  but  this  can  be  done  by  polarity  declarations  rather  than  by  axioms. 
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The  theorem-proving  system  we  have  presented  has  been  motivated  by  our  work  in  program  synthesis, 
and  the  best  examples  we  have  of  its  use  are  in  this  domain.  We  have  used  the  system  to  write  detailed 
derivations  for  programs  over  the  integers  and  real  numbers,  the  lists,  the  sets,  and  other  structures.  These 
derivations  are  concise  and  easy  to  follow:  they  reflect  intuitive  derivations  of  the  same  programs.  A  paper 
by  Traugott  [85]  describes  the  application  of  this  system  to  the  derivation  of  several  sorting  programs.  A 
paper  by  Manna  and  Waldinger  [85]  describes  the  derivation  of  several  binary-search  programs.  Our  earlier 
informal  derivation  of  the  unification  algorithm  (Manna  and  Waldinger  [81])  can  be  expressed  formally  in 
this  system. 

An  interactive  implementation  of  the  basic  nonclausal  theorem-proving  system  was  completed  by  Malachi 
and  has  been  extended  by  Bronstein  to  include  some  of  the  relation  rules.  An  entirely  automatic  imple¬ 
mentation  is  being  contemplated.  The  relation  rules  will  also  be  valuable  for  proving  purely  mathematical 
theorems.  For  this  purpose  they  may  be  incorporated  into  clausal  as  well  as  nonclausal  theorem-proving 
systems. 

Theorem  provers  have  exhibited  superhuman  abilities  in  limited  subject  domains,  but  seem  least  com¬ 
petent  in  areas  in  which  human  intuition  is  best  developed.  One  reason  for  this  is  that  an  axiomatic 
formalization  obscures  the  simplicity  of  the  subject  area;  facts  that  a  person  would  consider  too  obvious  to 
require  saying  in  an  intuitive  argument  must  be  stated  explicitly  and  dealt  with  in  the  corresponding  formal 
proof.  A  person  who  is  easily  able  to  conduct  the  argument  informally  may  well  be  unable  to  understand 
the  formal  proof,  let  alone  to  produce  it. 

Our  work  in  special  relations  is  part  of  a  continuing  effort  to  make  formal  theorem  proving  resemble 
intuitive  reasoning.  In  the  kind  of  system  we  envision,  proofs  are  shorter,  the  search  space  is  compressed, 
and  heuristics  based  on  human  intuition  become  applicable. 
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